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Abstract: 



We present a determination of the parton distributions of the nucleon from a global set 
of hard scattering data using the NNPDF methodology at LO and NNLO in perturbative 
k> I QCD, thereby generalizing to these orders the NNPDF2.1 NLO parton set. Heavy quark 

;h ' masses are included using the so-called FONLL method, which is benchmarked here at 

NNLO. We demonstrate the stability of PDFs upon inclusion of NNLO corrections, and 
we investigate the convergence of the perturbative expansion by comparing LO, NLO and 
NNLO results. We show that the momentum sum rule can be tested with increasing ac- 
curacy at LO, NLO and NNLO. We discuss the impact of NNLO corrections on collider 
phenomenology, specifically by comparing to recent LHC data. We present PDF determi- 
nations using a range of values of a^, rUc and m^. We also present PDF determinations 
based on various subsets of the global dataset, show that they generally lead to less accu- 
rate phenomenology, and discuss the possibility of future PDF determinations based on 
collider data only. 
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1 Introduction 

In a series of previous papers [IHS]) we have presented a novel methodology for the de- 
termination of parton distributions which strives to minimize parametrization bias and 
ad hoc statistical assumptions by using a Monte Carlo approach with neural networks as 
unbiased underlying interpolating functions. The statistical consistency of this approach 
was confirmed in Ref. [7j by showing explicitly that it yields a probability distribution of 
PDFs which, upon the inclusion of new data, behaves in accordance with Bayes' theorem. 
In Ref. [8] this NNPDF methodology was used to determine a set of parton distributions 
based on a global dataset, using NLO QCD, with inclusion of heavy quark mass effects. 
This PDF set, called NNPDF2.1, is arguably the most accurate NLO PDF set currently 
available, from every point of view: dataset, theoretical treatment, and statistical method- 
ology. It has been made available for a variety of values of the strong coupling and of the 
heavy quark masses. 

In this paper, we provide companion PDF sets based on the same methodology and 
data, but now using LO or NNLO theory: NNPDF2.1 LO and NNPDF2.1 NNLO. Both 
are needed for collider phenomenology: the LO parton distributions are principally for use 
with LO Monte Carlos, while the NNLO sets are needed for evaluation of LHC standard 
candle processes, some of which (such as Higgs production) are characterized by large 
NNLO QCD corrections and are either measured or measurable to an accuracy which 
may be comparable to the size of NNLO effects. 

The full theoretical framework that is necessary in order to construct NNLO (and of 
course LO) PDFs is already available. There are however several implementation issues 
which must be dealt with. At LO, parton distributions can be interpreted as probability 
distributions, and they are therefore non-negative: to ensure this, it will be advantageous 
to introduce a modification of the neural network parametrization of Ref. [6] such that 
positivity is hard-wired. Also, it has been suggested [U1[TU] that it may be useful to relax 
the momentum sum rule at leading order, and use a next-to- leading order form of the 
strong coupling in the determination of LO PDFs. All these issues will be investigated. 

As we move now to NNLO, we have to address the issue of implementing higher order 
corrections in a numerically efficient way. In Ref. |6j we developed a method, dubbed 
FastKernel, for the inclusion of NLO corrections to parton evolution and to the computa- 
tion of deep-inelastic (DIS) and Drell-Yan (DY) observables, without the use of iC-factors. 
Here, we will use the same method for the computation of evolution and DIS to NNLO. 
Heavy quark mass effects will be included using the so-called FONLL method, first de- 
veloped for hadronic processes in Ref. [H] and extended to DIS in Ref. [12] : the imple- 
mentation of the FONLL method up to NNLO (called FONLL-C in Ref. [I2]) requires 
the computation of some hitherto unknown Mellin transforms, and its implementation in 
the FastKernel framework must be benchmarked. For Drell-Yan we will rely on the NLO 
FastKernel implementation of Ref. [12], with NNLO corrections to it included by means 
of if -factors (note that in other global PDF fits such as Refs. [13^1141 both NLO and 
NNLO corrections to Drell-Yan are included using if -factors). This computation of the 
Drell-Yan process to NNLO will also be benchmarked. For the inclusive jet cross-section 
we will employ the threshold approximation to the NNLO corrections, since exact results 
are as yet unknown: these will be implemented using the FastNLO code [15J. 

While we will refer to our previous papers for a general introduction to the NNPDF 



methodology and for a detailed description of the NNPDF2.1 NLO PDF set, here we will 
document all the new issues that arise in the determination of LO and NNLO PDFs, 
specifically those mentioned above. With LO, NLO and NNLO results at our disposal, 
we will be able to investigate the perturbative stability of PDFs. We will thus be able 
to show that for PDFs in the kinematic range currently accessible the convergence of the 
perturbative expansion is very good: in particular NNLO PDFs are quite close to NLO 
ones. In particular, we will perform a study of the momentum sum rule at LO, NLO 
and NNLO, based on PDF determinations in which the sum rule is not imposed as a 
constraint and check that indeed the sum rule follows from the experimental data. We 
will then perform some phenomenological NNLO studies, in particular for LHC standard 
candles. Finally, we will discuss, in the context of the NNLO determination — which is 
theoretically the most accurate — the dependence of results on the value of the strong 
coupling and the size of the dataset, which are the main potential sources of uncertainty. 
The outline of this paper is the following. In Sect. [2] we present the experimental data 
used in the analysis: these only differ from those used in the NNPDF2.1 NLO determi- 
nation of Ref. [8] in that the inclusion of NNLO heavy quark corrections allows for looser 
kinematic cuts on charm structure function data. In Sect. [3] we summarize our computa- 
tion of all NNLO physical observables that enter the PDF fit, and specifically discuss the 
NNLO heavy quark mass implementation, and the implementation of NNLO corrections 
to the Drell-Yan process. Mellin transforms of the NNLO heavy quark coefficient functions 
are given in Appendix |X1 In Sect. [J] we discuss modifications to the PDF parametriza- 
tion and minimization which have been performed at LO and NNLO, in particular to 
optimize the requirement of positivity at LO, and to obtain accurate minimizations at 
LO and NNLO. The NNPDF2.1 LO and NNPDF2.1 NNLO sets are presented in Sect. 
and Sect. [6] respectively, where they are also compared to other available PDF sets. In 
Sect. [7] we examine the convergence of the perturbative expansions for individual PDF 
flavours, and perform a precision determination of the momentum carried by quarks and 
gluons in the nucleon. The implications of NNPDF2.1 NNLO PDFs for LHC phenomenol- 
ogy are reviewed in Sect. [HI where, after discussing the relevant parton luminosities, we 
present predictions for LHC standard candles and compare them to the LHC data which 
are available at present. We finally turn in Sect. [9] to the issues of the dependence of 
results on the value of the strong coupling, and the size of the dataset, which we will 
study by constructing PDFs based on various subsets of data (HERA only, DIS only, col- 
lider only, DIS+Drell-Yan). Technical details on the implementation and benchmarking 
of DIS structure functions and NNLO PDF evolution are collected in Appendices |B] and 
ICl respectively. 
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Table 1: Charm structure function datasets included in the NNPDF2.1 NNLO analysis. All other 
data are the same as in the NNPDF2.1 NLO analysis, given in Table 2 of Ref. [8]. The number of 
data points after kinematic cuts are shown in parentheses. In the last three lines we give the total 
number of datapoints included in the NNPDF2.1 LO, NLO and NNLO fits. 

2 Experimental data 

The experimental data on which the LO and NNLO PDF sets are based are the same as 
those used for the NNPDF2.1 NLO set of Ref. [8] and discussed there, with some minor 
differences in data and kinematic cuts which we discuss here. 



2.1 Data sets 

The NNPDF2.1 NLO dataset includes NMC |23|l2i] . BCDMS [251126] and SLAG [27] deep- 
inelastic scattering (DIS) fixed target data; the combined HERA-I DIS dataset [28], HERA 
Fl [29] and F| structure function data [MSS], ZEUS HERA-II DIS cross-sections \mm. 



CHORUS [32] inclusive neutrino DIS, and NuTeV [33y34] dimuon production data; fixed- 
target E605 ^ and E866 [HeHSg Drell-Yan production data; CDF [39j W asymmetry 
and CDF [IQ] and DO [H] Z rapidity distributions; and CDF [l2] and DO [33] Run-II 
one-jet inclusive cross-sections. A scatter plot of these data in the x, Q^ plane is displayed 
in Fig. [H with the values of x determined using LO kinematics. 

In the NNPDF2.1 LO fit, the dataset is modified in comparison to the NLO dataset 
in that the Fl structure function data are removed, since this observable vanishes at LO. 

In the NNPDF2.1 NNLO fit, the dataset is modified in two respects in comparison 
to the NLO dataset. First, the E866 data, published as xp distributions, have been con- 
verted into rapidity distributions, since the use of rapidity as kinematic variable makes the 
inclusion of NNLO corrections simpler. This was done following the procedure discussed 
in Ref. [44] . by using the average px of the lepton pair in each bin. We have verified 
explicitly that if this procedure is also applied at NLO, the fit results are unchanged. 
Second, the NMC proton data are now included as data for reduced cross-sections, rather 
than for structure functions. It was shown in Ref. [l5] that the impact of this different 
treatment is almost negligible at NLO. However the use of cross-section data is in principle 
preferable, as they are closer to what is actually measured. In Ref. [3U] it was claimed 
that the treatment of these data may have a significant impact on NNLO PDFs, though 
this claim is not supported by preliminary investigations with NNPDF2.1 NNLO [47] . or 
with MSTW08 |l9] PDFs. 
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Figure 1: The experimental data which enter the NNPDF2.1 PDF determination with NNLO 
kinematic cuts. 



2.2 Kinematic cuts 

All data included in the NNPDF2.1 LO, NLO and NNLO fits are subject to cuts on the 
invariant mass W'^ and the scale Q^ of the DIS final state W^^^ > 12.5 GeV^ and Q^ > 3 
GeV^. In the NLO fit, the F^ data were subject to the further cuts Q^ > 4 GeV^ and 
Q^ > 10 GeV'^ if X < 10~^, due to the fact that in this region NNLO massive corrections 
are so large that a NLO approximation is not acceptable. These cuts will be removed for 
the NNLO fit, in which the Fg data will only be subject to the cuts which are common to 
all other DIS data. The LO fit instead will use the same cuts as the NLO one. The charm 
structure function data included in the NNLO fit are listed in Tabled) all other data are 
the same as in the NLO fit, Table 2 of Ref. [8]. The total numbers of datapoints used at 
LO, NLO and NNLO are also given in Tabled) 



3 Physical observables 

As mentioned in the introduction, there are two aspects of the theoretical implementation 
of QCD corrections which require some discussion away from NLO: the treatment of heavy 
quark masses, and the fast implementation of NNLO corrections to hadronic observables 
and the corresponding benchmarking. The former will be discussed in Sect. 13.11 both at 
LO and NNLO, while the analytic expression of various hitherto unknown NNLO coeffi- 
cient functions in Mellin space are listed in Appendix lAl the general-mass deep-inelastic 
coefficient functions will then be benchmarked in Appendix[Bl The latter will be discussed 
and benchmarked in Sect. 13.21 Perturbative evolution at NNLO will be benchmarked in 
Appendix O Given that for jets we will rely on the FastNLO code [15j, this provides us 
with a full benchmarking of all NNLO expressions used. 

3.1 LO and NNLO structure functions with heavy quark mass effects 

We will include heavy quark masses using the FONLL method of Ref. [11], extended to 
DIS in Ref. [12] (see Ref. [8] for charged-current DIS expression). In the NNPDF2.1 
NLO fits [8] we used the FONLL-A scheme, which combines NLO massless perturbative 
evolution with the O (as) massive coefficient functions. 

At LO, the massive neutral-current coefficient function vanishes, and thus, for neutral 
current DIS, the FONLL scheme of Ref. [12] only differs from a naive zero-mass scheme 
by a damping factor which suppresses dynamically generated charm contributions below 
threshold. The same is true for charged-current DIS when the heavy quark is the struck 
quark, while for contributions in which a heavy quark is produced from a struck light quark 
the FONLL expression simply reduces to the parton-model {0{a^)) massive coefficient 
function. 

At NNLO, it is possible to combine NNLO massless perturbative evolution with the 
0(0^) massive coefficient functions: this was called the FONLL-C scheme in Ref. [12]. It is 
also possible to instead combine the 0{a'^) massive coefficient functions with NLO massless 
perturbative evolution, which was called FONLL-B scheme in Ref. [12j; a comparison of 
FONLL-B with FONLL-A for NLO fits will be performed elsewhere. 

Different schemes for the inclusion of the heavy quark mass in DIS structure functions 
were benchmarked in Ref. [38] , with common input toy PDFs and common choices of all 
other settings, such as the values of the heavy quark masses. Preliminary comparisons 
of FONLL-C with S-ACOT-x at NNLO [50] suggest that the two GM-VFN schemes are 
numerically similar. 

The charm structure functions F2^c and Fl^c computed in the FONLL-C scheme are 
shown in Fig. [2] and Fig. [3] respectively. They are compared to the NNLO determination of 
the structure function in which the heavy quark mass is neglected (ZM-VFN, or zero-mass 
variable-flavour number scheme), and to the O(a^) computation in a fixed nj = 3 scheme 
with charm mass (FFN). It is clear from the plots that the FONLL-C scheme interpolates 
smoothly between the O (a^) massive scheme (FFN) near the heavy quark threshold, and 
the O (a^) massless scheme (ZM-VFN) at large Q^. Mass effects are much larger for the 
longitudinal structure function F^^c than for -F2,ci so there the ZM-VFN computation is 
completely unreliable. 

In the sequel we will adopt FONLL-C with threshold damping factor ^12j as our default 
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Figure 2: The charm structure function F^^Ax^Q^) as a function of Q^ for different values of x 
from X — 10^^ to a; = 10~^ computed in the FONLL-C scheme, and compared to the zero mass 
(ZM-VFN) and fixed-flavour number (FFN) results. The reference input PDFs and settings of Les 
Houches benchmarks [15] are used throughout. 



choice for the NNPDF2.1 NNLO fit. The heavy quark masses will also take the same 
default values as in the NNPDF2.1 NLO fit, namely rric = 1.414 GeV and m?, = 4.75 GeV. 
These should be taken as pole masses, because this choice is adopted in the construction 
of the FONLL-B and FONLL-C expressions given in Ref. [1^. The use of running MS 
masses has been advocated recently [51] because of the greater perturbative stability of 
the running mass: this possibility will be studied in future NNPDF releases. 

All the above discussion applies to neutral-current structure functions. In the case 
of charged-current DIS, a full implementation of the FONLL-C scheme is not possible, 
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Figure 3: Same as Fig. [2]for the longitudinal charm structure function Fl,c{x, Q^)- 



because the massive 0{a1) heavy quark coefficient functions are not available (only the 
asymptotic Q^ — ?• 00 limit is known [52]). Consequently, in the FONLL-C charged current 
structure functions the 0{a1) massive contribution is set to zero, while PDFs, the ZM 
structure functions and a^ are evaluated at NNLO. We have checked that the impact of 
NNLO corrections in the charged current sector is very moderate, typically well below 
10%. This choice achieves the best accuracy that can be obtained from the available 
perturbative information without introducing any modelling. 

All NNLO structure functions have been implemented in the FastKernel framework of 
Ref. [6J. The benchmarking of the numerical accuracy of the implementation is discussed 
in Appendix [Bl 
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Figure 4: The NNLO/NLO iiT-factors for the Dreh-Yan, W and Z production data included in 
the NNPDF2.1 fit. 



3.2 The treatment of hadronic data 

We now turn to the NNLO implementation of the hadronic data, namely, Drell-Yan, W 
and Z production, and inclusive jets. 

For NNPDF2.1 NLO, Drell-Yan and vector boson production were treated consistently 
at NLO in perturbative QCD in all the stages of the PDF fit using the FastKernel frame- 
work [6j. The extension of the FastKernel method to NNLO is in principle straightforward, 
but in practice challenging, in particular because of the distribution structure and intricate 
choice of kinematic dependence of the NNLO coefficient functions of Ref. [53]. Therefore, 
here we instead adopt an approximate NNLO computation, which leads to an accuracy 
which is fully adequate for our purposes as we shall now show. 

In this approximation, Drell-Yan observables are computed with NNLO PDF evolution 
and NLO partonic cross-section supplemented by a if -factor that accounts for the missing 
0(0^) partonic coefficient functions. The if- factors are defined as the ratio of double 
differential cross-sections cPa/dydA-'I^ in Drell-Yan production where in the numerator 
we use the full NNLO expression and in the denominator the same expression but with 
the O (a^) correction to the partonic cross-section set to zero. In this definition the same 
NNLO PDFs and «« are used both in the numerator and the denominator. This minimizes 
the impact of the NNLO if -factor corrections, which are then reduced to the missing 
O (og) partonic cross-sections. These are rather small for most processes of interest, 
especially for collider kinematics. 

We have computed these if -factors using the VRAP code ^53j , and cross-checked the 
results with the DYRAP [5l] program. The A'-factors are computed iteratively using 
central PDFs from a previous NNPDF2.1 NNLO fit. Results are shown in Fig. [J] for 
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Figure 5: Percentage difference between the values of the K factors used in the NNPDF2.1 PDF 
fit (shown in Fig. 0]) and their redetermination with the final NNPDF2.1 PDFs. The error bar is 
obtained as one sigma uncertainty over replicas. 

the different datasets as a function of the rapidity y of the produced electroweak boson. 
For collider observables like W/Z production at the Tevatron the K-iactois are at the 
few percent level. The NNLO iiT-factors are more important for fixed-target Drell-Yan 
data, in particular for the E866 kinematics, where they are typically of order 10%, but 
sometimes as large as 25%. However, the average total experimental uncertainty on these 
data is larger than 20%. 

The error incurred replica by replica through the use of the i^-factor approximation 
is then subleading in the perturbative expansion. To see this, note that approximation in 
the computation of the K factor comes from its dependence on the PDF. However, the 
X-factor only enters at NNLO, while the NLO is exact, so the error in the PDF used to 
compute the iiT-factor is O [a'^)- However the i^T-factor itself is O (a^), so the error in 
the cross section is in fact O (af ) . Note that this is not the case if A'-factors are used for 
both NLO and NNLO corrections (as in Refs. |13p 4j). since then the error in the PDF is 
O (as), and thus O (a^) in the cross section, hence at least in principle of the same order 
as the NNLO correction to the cross section itself. 

In Fig. [5] we show the percentage shift of the X-factors if they are recomputed using 
the final NNPDF2.1 NNLO PDF set. In order to study the possible dependence of the 
accuracy of the K-iactor approximation on the choice of (central) PDF used to compute 
the i^-factors, we have repeated this comparison for ten different randomly chosen replicas. 
The standard deviation of the results is also shown as an error bar in Fig. [5j We conclude 
that the accuracy is always better than 3%, which translates into an uncertainty of no 
more than 0.7% in the cross-section. 
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For the inclusive jet production data exact NNLO corrections are not known. How- 
ever, an approximation to the fuh NNLO result based on threshold resuniniation is avail- 
able [55j. We will compute inclusive jet observables using an approximation on which 
PDFs are evolved at NNLO, but coefficient functions are computed using this threshold 
approximation of the full NNLO result, as implemented in the FastNLO code [15j. This 
provides us with an approximate NNLO calculation which combines the most accurate 
perturbative information available. Fits in which the jet data are evaluated with NLO 
coefficient functions, or simply removed altogether, will be discussed in Sect. 
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4 Implementation issues at LO and NNLO 

The parton parametrization, minimization algorithm, and determination of the optimal fit 
in the LO and NNLO fits presented here are the same as in the NLO PDF determination of 
Ref. [5] , including almost all settings for the parameters which control the parametrization 
and minimization. The small number of changes are discussed here: first, we describe how 
at LO the parton parametrization is optimized in view of LO positivity constraints, and 
then we examine some adjustments in the choices of parameters for the genetic algorithm 
and the stopping of the minimization. 

4.1 Parton parametrization: positivity constraints 

In all previous NNPDF fits, positivity of physical observables has been imposed: beyond 
leading order, PDFs depend on the factorization scheme and can be either positive or 
negative, however cross-sections must remain non-negative [56]. Positivity of physical 
observables at NLO was enforced by means of Lagrange multipliers (see Ref. [8]), and the 
same procedure will be used here for the determination of NNLO PDFs. 

However, at leading order parton distributions admit a probabilistic interpretation and 
are thus non-negative. The positivity constraint can then be imposed directly on all PDFs 
at the initial scale. Leading-order evolution preserves the probabilistic interpretation of 
PDFs |57y58j. hence this is sufficient to guarantee positivity at all scales. This can again 
be done by Lagrange multipliers, i.e. adding to the x^ a large penalty term whenever any 
of the individual PDFs turns negative. However, in order to speed up the LO PDF fits, 
it is advantageous to impose positivity directly at the level of the PDF parametrization. 
Within the neural network PDF parametrization which we adopt, this can be done as 
follows. Recall that in the architecure that we adopt for neural networks, the response 
function is a sigmoid 

el" = s(E4'r" -".'")• "W-TTF^- w 

for hidden layer, but it is linear in the last layer. For the LO fits, we adopt instead for the 
last layer a quadratic reponse function 



^i = E^'r -^r' ■ (2) 



The output of the neural network, and thus the PDF, is then guaranteed to be non- 
negative. 

The basis of PDFs that are parametrized by neural networks in NNPDF fits [11[5] 
includes the gluon, quark singlet, and various other linear combinations of quark PDFs. 
Of these, only the gluon and singlet must be positive, since all other combinations con- 
tain differences of PDFs. However, in practice also the total valence and isospin triplet 
combination are positive definite. Hence the parametrization Eq. ([T][2]) for simplicity is 
adopted for all PDFs: PDFs other than singlet, gluon, valence and triplet are allowed to 
change sign by simply adding to the above form a constant shift. It turns out that with 
the constraints from the data, this is sufficient in practice to ensure positivity of all PDFs: 
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Armut 


Tymax 


^sw 


-'^mut 


-'^mut 


LO & NLO 


10000 


2500 


30000 


2.6 


80 


10 


NNLO 


10000 


2500 


30000 


2.3 


80 


30 





LO & NLO 




NNLO 




PDF 


A^mut 


j^k 


iVmut 


^k 


S(x) 


2 


10,1 


2 


10,1 


9{x) 


2 


10,1 


3 


10,3,0.4 


Tsix) 


2 


1,0.1 


2 


1,0.1 


V{x) 


2 


1,0.1 


3 


8,1,0.1 


As{x) 


2 


1,0.1 


3 


5,1,0.1 


s+{x) 


2 


5,0.5 


2 


5,0.5 


s-{x) 


2 


1,0.1 


2 


1,0.1 



Table 2: Parameter values for the genetic algorithm for the NNLO fits compared to those of the 
LO and NLO fits (top). The number of mutations and the values of the mutation rates for the 
individual PDFs in the NNLO fit as compared to the values of the LO and NLO fits are also given 
(bottom). 

we have checked a posteriori that for every replica the gluon and all individual quark and 
anti-quark flavours are positive for all values of x and Q^ for which the NNPDF2.1 LO 
PDFs are provided. 

4.2 Minimization and stopping 

The poorer quality of the LO fit on the one hand, and the greater complexity of NNLO 
coefficient functions on the other hand, require some retuning of the parameters of the 
minimization algorithm. 

At leading order, the best-fit value of the figure of merit E^''^ which is being minimized 
for each replica (which is essentially the x^ of the fit of each PDF replica to the given data 
replica) is on average rather larger than in an NLO fit, because of the poorer accuracy of 
the LO theory. This is particularly true for the Drell-Yan observables, which have large 
NLO corrections with a if -factor of order two. As a consequence, the minimum value that 
E^ ' must reach for each experiment in order for the fit to stop has been increased from 
Eth = 6 to E^ = 12 for all Drell-Yan experiments. Furthermore, the cross-validation 
method that we use to determine the optimal fit stops the minimization when the moving 
average (over iterations of the genetic algorithm) of E^ ' increases more than a fixed 
percentage threshold value r^, larger than a typical random fluctuation. Because the size 
of fluctuations of E^ > remains fixed, while its value at best fit has increased, the typical 
values of r„ are smaller at LO, and thus it turns out to be necessary to reduce the value 
of r„ required for stopping to r„ — 1 = 2 • 10~^, from r„ — 1 = 3 • 10~^ used at NLO. 

Even with these adjustments, for a sizable fraction of replicas the cross-validation algo- 
rithm fails to stop dynamically the minimization even after a large number of generations 
of the genetic algorithm. This reflects the poor accuracy of LO theory, and it could only 
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be obviated by letting the genetic minimization run much longer. In view of the large 
theoretical uncertainties inherent to any LO PDF determination, as a practical compro- 
mise, we discard replicas that do not stop dynamically after 50000 iterations of the genetic 
algorithm, retaining only those replicas for which the stopping criterion was fulfilled. We 
have checked that this leads to no significant statistical bias. 

At next-to-next-to-leading order the partonic cross-sections have rather more structure 
than at lower orders, both because of the opening of new partonic channels and because of 
the appearance of new transcendental functions in the perturbative results (such as higher 
order harmonic sums). This results in somewhat more complex PDF shapes. As a conse- 
quence, it turns out to be necessary to increase the number of mutants and mutations per 
PDF in the genetic minimization in order to fully explore this more complex space of min- 
ima. The NNLO settings for the genetic algorithm used for minimization are summarized 
in Tableland compared to those used at LO and NLO. The table is to be compared to 
Table 6 of Ref. [6] and Table 5 of Ref . [JJ , to which we refer for a more detailed discussion. 
We give the number of mutants in the two stages in which the minimization is divided, 
the number of mutations per PDF, and the values of the mutation rates for each PDF. 
We also show in a separate table the number of mutations that are applied to each PDF 
(which at NLO was two for all PDFs), and the values of the mutation rates ry of each 
PDF (which at NLO were given in Ref. [3j and kept unchanged in Ref. [6]). 
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PDF set 


.LHgrid file 


as {Mz) 


Momentum SR 


NNPDF2.1 LO 
NNPDF2.1 LO 
NNPDF2.1 LO* 
NNPDF2.1 LO* 


NNPDF21.1o_as_0119.100.LHgrid 

NNPDF21.1o_as_0130.100.LHgrid 

NNPDF21Jostar_as_0119.100.LHgrid 

NNPDF21Jostar_as_0130.100.LHgrid 


0.119 
0.130 
0.119 
0.130 


Yes 
Yes 

No 

No 



Table 3: Summary of NNPDF2.1 Leading Order PDF sets. 

5 Leading order parton distributions 

Parton distributions based on a leading-order QCD treatment of the data are mostly used 
with leading order Monte Carlo event generators, and are also of interest for comparison 
of QCD calculations at different perturbative orders. Of course, nothing prevents the 
inclusion of some NLO terms in a calculation which has LO accuracy, so in principle one 
could always use NLO PDFs in these and related contexts. However, in practice using 
NLO PDFs with LO matrix elements may lead to a poorly behaved perturbative expansion 
and to bad phenomenology. Indeed, the difference between the optimal PDFs determined 
from a LO analysis at their standard NLO counterparts is typically rather larger than 
NLO PDF uncertainties. Hence, the dominant uncertainty on LO PDFs is theoretical, 
and there is a certain latitude in their definition. Therefore, we will at first discuss various 
options for the construction of LO PDFs, then turn to results and comparisons, with the 
statistical aspects of the PDF determination now playing a relatively less important role. 



5.1 Definition of leading order PDFs 

The issue of choosing the optimal parton sets to be used in combination with LO event 
generators has been discussed extensively. On the one hand, the possibility of using 
standard LO QCD theory (including the running of as) seems theoretically simplest and 
most consistent. On the other hand, it could be that this leads to unacceptably poor fit 
quality for some datasets included in the global fit and thus modifications of the standard 
LO framework should be considered. 

The simplest of these, advocated in [SS], is to just use NLO PDFs within the LO 
Monte Carlo. This choice at hadron colliders can be justified by arguing that the difference 
between LO and NLO PDFs is driven by the difference in DIS matrix elements used in the 
PDF determination, but the LO and NLO matrix elements for hadron collider processes 
are much closer to each other, so NLO PDFs with LO collider matrix element may provide 
a reasonable approximation to the exact NLO result. However, it turns out that this choice 
requires a substantial retuning of the parameters in event generators. 

An intermediate possibility consists of including some dominant NLO corrections to 
the LO matrix elements: for instance, in Ref. [13j it was pointed out that a sizable fraction 
of the large NLO and NNLO K factor for Drell-Yan comes from contributions which have 
the same kinematics as the LO, and thus can be simply absorbed in a rescaling of the 
LO cross-section. The MSTW08LO PDFs of Ref. [13] where determined by rescaling the 
Drell-Yan cross-section in this way. 

More general modifications of the standard LO were suggested in Ref. [S] and adopted 
in the construction of the MRST20071omod PDFs. These PDFs are based on the obser- 
vation that the LO fit quality mostly deteriorates because of the faster gluon evolution 
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NLO 


LO as =0.119 


LO* as =0.119 


LO as =0.130 


LO* as =0.130 


Total x' 


1.16 


1.74 


1.76 


1.68 


1.74 


(x^«) 


1.25 ±0.07 


1.95 ±0.21 


1.89 ±0.22 


1.95 ±0.19 


1.94 ±0.18 


NMC-pd 


0.97 


1.43 


1.13 


1.18 


1.12 


NMC 


1.72 


2.05 


1.68 


1.74 


1.72 


SLAC 


1.29 


3.77 


3.00 


2.91 


2.70 


BCDMS 


1.24 


1.87 


1.82 


1.76 


1.75 


HERAI-AV 


1.07 


1.70 


1.55 


1.58 


1.59 


CHORUS 


1.15 


1.51 


1.67 


1.53 


1.67 


NTVDMN 


0.45 


0.69 


0.71 


0.71 


0.78 


ZEUS-H2 


1.29 


1.51 


1.42 


1.43 


1.44 


ZEUSF2C 


0.78 


1.75 


1.26 


1.56 


1.34 


H1F2C 


1.51 


1.77 


2.00 


1.81 


2.02 


DYE605 


0.85 


1.86 


2.02 


1.70 


1.83 


DYE886 


1.26 


1.99 


2.52 


2.59 


3.11 


CDFWASY 


1.83 


1.80 


2.50 


2.16 


2.29 


CDFZRAP 


1.64 


2.88 


3.89 


2.08 


2.58 


DOZRAP 


0.59 


1.07 


1.29 


0.87 


1.02 


CDFR2KT 


0.96 


2.60 


3.22 


2.45 


2.76 


D0R2CON 


0.83 


1.18 


1.56 


1.17 


1.35 


[M] 


1 


1 


1.16 ±0.03 


1 


1.09 ±0.03 



Table 4: Fit quality for the global fit and for all experiments included in it for each of the 
NNPDF2.1 LO PDF sets. The corresponding values for the NNPDF2.1 NLO set of Ref. [SD] are 
given for comparison. The value of the momentum integral [M] Eq. ([3]) is also shown. All the fits 
have A'rcp — 100 replicas. 

at small x and the slower quark density evolution at large x. A possible way to improve 
this is to use the NLO value for the strong coupling constant together with its two-loop 
running. This leads to smaller values of a^ in the low Q^ region where the small x data 
are concentrated, and thus to slower PDF evolution. Of course, use of NLO Ug within an 
otherwise LO framework is a subleading and thus legitimate change. Another possibility is 
to relax the momentum sum rule. This alleviates another possible problem of the LO fit, 
namely the fact that faster small x gluon evolution leads through the momentum sum rule 
to depletion of the gluon content at medium/large x, which may cause a poor description 
of large x fixed-target data. Of course, a violation of the momentum sum rule is in princi- 
ple forbidden by first principles, however it can be justified as an ad hoc phenomenological 
patch. 

Finally, one may take the point of view [lOJ that the goal of LO PDFs is to be used 
with Monte Carlo generators, and thus they should be determined by optimizing the 
agreement with the data of the predictions obtained by using them in combination with 
a generator. This involves considering all the various modifications of the minimal LO 
framework discussed above, and also introducing suitable pseudodata to optimize the 
agreement with Monte Carlo generators. The CT09MC1/MC2 [10] PDFs were constructed 
in this way. 

5.2 Quality of the fit 

We have produced four NNPDF2.1 LO PDF determinations: with two different values 
of as (Mz), 0.119 and 0.130 (but always with LO running of as), and with and without 
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imposing the momentum sum rule. The various PDF sets, including the name of the 
corresponding LHAPDF grid files, are summarized in Table O 

The x^ of ths four LO NNPDF2.1 sets, both for the global fit and for individual 
experiments, are collected in Table HI and compared to the corresponding results of the 
NNPDF2.1 NLO set. The value of x^ corresponds to the central PDF set (replica zero), 
obtained as the average over replicas, while (x^ ^^') is the average over the replica sample of 
the x^ of each PDF replica. We refer to Sect. 4 of Ref. [8] for a more detailed discussion of 
the various statistical indicators: here it will suffice to say that all x^ values are computed 
including the full covariance matrix of each experiment, with normalization uncertainties 
included using the method of Ref. [60] . 

The fit quality is the same within uncertainties in all four cases: the values of {x^ ) 
differ from each other by less than a standard deviation. The fit with a larger value 
Us {Mz) = 0.130 seems to be slightly favored, but the difference in x^ as the value of Ug is 
varied is so small that we have not pursued further the option of also using NLO running 
of the strong coupling. The behaviour of the fit when the momentum sum rule (MSR) is 
not imposed is interesting: while the global fit quality is the same as in the fit with the 
MSR, the x^ of individual experiments changes significantly: the fit quality improves for 
some sets (like for example HERA), but relaxing the MSR leads to a worse description of 
the hadronic data. 

Note that we can fit simultaneously the Drell-Yan and deep-inelastic data without 
having to rescale the Drell-Yan data as discussed in Sect. 15.11 (unlike Ref. [13.J)- Of course, 
such a rescaling would likely lead to an improvement of the agreement of quality of the 
fit to Drell-Yan data. However, the ensuing PDFs would then be optimized for use in 
conjunction with codes (Monte Carlo or otherwise) in which similar corrections are also 
included. Optimizing LO PDFs in view of their use with some specific code such as a 
Monte Carlo event generator, as was done in Ref. [lOj, is an interesting task; however, we 
will not pursue it here, where we are rather mostly interested in constructing PDFs based 
on pure LO theory, with the clear limitations that this implies. 

In Table H] we also give the value of the momentum integral 



[M] = / dxxS(x,Q^)+ / dxxg{x,Q^) 
Jo Jo 



(3) 



for each of LO PDF sets. These are determined at the starting scale Qq = 2 GeV^, but 
note that the momentum integral [M] does not depend on scale. A discussion of the 
behaviour of the momentum integral at LO, NLO and NNLO will be given in Sect. 17.21 
below. 

In summary, while we do find a non-negligible deterioration in fit quality in comparison 
to the NLO fit, we do not find that this can be improved by either relaxing the momentum 
sum rule or changing the value of a^. Preliminary investigations using the NLO running 
of as also did not show significant improvements in fit quality. We did, however, find a 
significant improvement in fit quality if the positivity constraint on PDFs is removed: the 
X^ of the LO fit then becomes only about 10% higher than in the NLO case. The price to 
pay for this is that the gluon becomes rather negative at large x. However, negative LO 
PDFs are not acceptable, as they might lead to negative cross-sections; therefore we have 
not pursued this possibility further. 
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Figure 6: Distances between tlie reference LO and NLO NNPDF2.1 sets. Here and in subsequent 
figures in this section, the left plot shows the distance between central values, while the right plot 
shows the distance between the uncertainties. Both have as{Mz) = 0.119. 



5.3 Parton distributions 

We now compare the four LO PDF sets with each other, with NLO PDFs, and with other 
available LO PDF sets. We will compute the distance between central values and uncer- 
tainties of the various pairs of PDFs which are being compared, defined as in Appendix A 
of Ref. [6]. Recall that with A'^rep = 100 replicas a distance d = 1 corresponds to central 
values which differ by —/m'^: with a the sum in quadrature of the uncertainties of the two 
sets. If the sets which are being compared are statistically equivalent, then all distances 
are of order one, while if they are statistically inequivalent but consistent at the n sigma 
level, then distances are of order of d ~ 7n. 

We begin by comparing the NNPDF2.1 LO set with as=0.119, which we take as the 
LO reference, to the reference NNPDF2.1 NLO set. The corresponding distances are 
plotted in Fig. [6l while the singlet, valence and gluon PDFs are compared in Fig. [71 A 
full comparison of all LO, NLO and NNLO PDFs will be presented in Sect. 17.11 It is 
clear from Fig. [6] that LO and NLO PDF uncertainties, though clearly not statistically 
equivalent, are consistent at the one sigma level: this shows that these uncertainties 
essentially reflect the uncertainty of the underlying data, which are the same in the two 
PDF determinations. On the other hand, central values differ by many sigma: this means 
that, as already mentioned, the difference between LO and NLO PDFs is much larger 
than the uncertainty on either, and thus the dominant uncertainty on LO PDF is the 
theoretical uncertainty due to the lack of inclusion of higher order corrections. 

The largest shift from LO to NLO, more than five times larger than the PDF uncer- 
tainty, is observed for the gluon at medium-small x (10~^ 1^ x < 0.05), consistent with 
the fact that the gluon decouples from LO observables, but also the large x quark singlet 
and valence distributions change by more than three sigma. Generally, the LO gluon is 
larger than the NLO one. However, for x < 10~^, where there are no data to constrain the 
fits, the LO and NLO gluons become consistent within the large PDF uncertainties. At 
larger x, the LO and NLO gluons are quite similar and compatible within the respective 
uncertainties. The LO quark is rather smaller (by more than one sigma) than the NLO 
one for large x > 0.1, but it becomes compatible with it at the one sigma level for smaller 
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Figure 7: Comparison of the quark singlet, valence and gluon distributions for the pair of PDF 
fits whose distances are plotted in Fig. [Sj 



X. Finally, the light sea and strangeness asymmetries are minimally affected and quite 
close at LO and NLO. 

It is interesting to observe that the missing large NLO i^-factors in Drell-Yan data 
should enhance the LO quark distributions in comparison to the NLO ones: the fact that 
they end up being instead either smaller or comparable suggests that the Drell-Yan data 
actually have relatively little effect on the LO fit, other than through the determination 
of the u — d light flavor asymmetry. This is less sensitive to the K factors (being mostly 
determined by a cross-section ratio), and indeed turns out to be almost the same at LO 
and NLO. 

Next we compare the various LO PDF sets to each other. First we compare the two 
LO sets which differ in value of the strong coupling, as{Mz) = 0.119 vs. as{Mz) = 0.130. 
The larger value of the strong coupling, when evolved down to a scale Q^ ~ 10 GeV"^ 
using LO evolution, leads to a value of a^ close to that preferred by data in this region. 
Hence, the larger value leads to a better description of scaling violations at low scale, and 
conversely, as it is apparent from Table [H 

The distances for central values and uncertainties between the LO fits with different 
as are plotted in Fig. [HI The only PDF which is significantly affected by the value of a^ 
is the gluon, which, as shown in Fig. [9l becomes smaller at medium-small x and thus, by 
the momentum sum rule, somewhat larger at large x when ag is increased. This makes 
the LO gluon with larger as closer to the NLO gluon. However, the shift as as is varied 
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Figure 9: Comparison of the quark singlet, and gluon distributions for the pair of PDF fits whose 
distances are plotted in Fig. [5] 



in this range is comparable to the PDF uncertainty. Also, the large x singlet and valence 
quark PDFs increase somewhat when Og is raised, especially at large x ~ 0.3, where a 
shift of about two sigma is observed. 

The effect of relaxing the momentum sum rule is studied by comparing the LO and LO* 
sets. Those with as{Mz) = 0.119 are compared in Fig. [TUl where the distance between 
them is displayed. The main difference is seen in the medium x gluon, as shown in Fig, lilt 
the LO* gluon is rather larger than the LO one. However, the central values for all quark 
PDFs are very close to the standard LO ones. 

In conclusion, we compare the NNPDF2.1 LO PDFs to other available LO sets. First, 
we compare the NNPDF2.1 LO set with a.^ = 0.130 to MSTW08 LO [T3] {a^ = 0.139) 
and CTEQ6L1 [61j (a^ = 0.130) in Fig. [121 Differences are especially large for the gluon 
distribution, both at small and large x, and for the isospin triplet distribution and large x, 
though differences between the NNPDF and MSTW sets are mostly compatible with the 
large uncertainties, while the difference between CTEQ and other sets is more difficult to 
quantify more precisely because CTEQ LO PDFs come without an uncertainty estimate. 

Finally we compare with the modified LO PDF sets MRST20071omod [9j , and with the 
dedicated Monte Carlo sets of the CTEQ/TEA collaboration [10], CT09MC1, CT09MC2 
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Figure 10: Distances between the NNPDF2.1 LO and LO* sets with a,,=0.119. 
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Figure 11: Comparison of the valence and gluon distributions for the pair of PDF fits whose 
distances are plotted in Fig. (TUl 



and CT09MCS. The MRST20071omod set is obtained relaxing the momentum sum rule 
and using two-loop running of a^; with as{Mz^ = 0.121. The CT sets are based on an LO 
QCD analysis framework of data which, on top of the standard global dataset used for the 
NLO PDF determination, also includes a set of LHC pseudo-data generated using NLO 
PDFs. The normalization of the LO calculation for each pseudo-data set is allowed to float 
to reach the best agreement with the NLO cross-section. The CT09MCS is extracted from 
an analysis in which the two-loop strong coupling is used and the momentum sum rule is 
imposed during the fit. The CT09MC1 and CT09MC2 are fits in which the momentum 
sum rule is relaxed and one- and two-loop expressions for Og are used respectively. All 
these sets are compared to the default NNPDF2.1 LO set in Fig. 1131 Differences between 
these modified PDF sets are significant, and typically larger than the difference between 
the NNPDF2.1 LO and LO* sets, consistent with the fact that they are based on different 
methodologies and assumptions. 
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Figure 12: Comparison of LO PDFs: tlie quark singlet, triplet and gluon PDFs are shown for the 
NNPDF2.1, MSTW08 and CTEQ6L1 sets. 
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Figure 13: Comparison of the NNPDF2.1 LO PDF to modified LO PDF sets: MRST20071omod, 
CT09MC1, CT09MC2 and CT09MCS. 
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Xtot 



{E) ± aE 

{Etr) ± aE,, 

(-^val) ± (TE^,i 

(TL) ± CTTL 



X 



alfcT^ 



^^y^ 



P^)dat^ 






1.16 

2.22 ±0.07 
2.19 ±0.09 
2.27 ±0.10 
(17 ±7) 10^ 



1.23 ±0.05 



11.9 
3.2 



0.18 
0.53 



Table 5: Table of statistical estimators for the NNPDF2.1 NNLO fit with N^cp = 1000 rephcas. 
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Figure 14: Distribution of x^^*^^ (left) and E^^ ' (right), over the sample of N,ep — 1000 rephcas 



6 Next-to-next-to-leading order parton distributions 

Next-to-next-to-leading order PDFs are mostly of interest for their use in the computation 
of standard candle processes such as W, Z, top and Higgs production at hadron colliders. 
In this section we will discuss the statistical features of the NNLO fit, then present the 
NNPDF2.1 NNLO PDFs and compare them to other available NNLO sets. The imphca- 
tions of the NNPDF2.1 NNLO set for LHC observables are discussed in Sect. [HI 

6.1 Statistical features 

Statistical estimators for the NNPDF2.1 NNLO fit are shown in Table [5] for the global 
fit and in Table [6] for individual experiments, with, in the latter case, the NLO x^ values 
also shown for comparison. While referring to Refs. [51[51l60t[52] for a detailed discussion 
of statistical indicators and their meaning, we recall that Xtot i^ computed comparing the 
central (average) NNPDF2.1 fit to the original experimental data, (x ) is computed 
comparing to the data each NNPDF2.1 replica and averaging over replicas, while {E) 
is the quantity which is minimized, i.e. it coincides with the x^ computed comparing 
each NNPDF2.1 replica to the data replica it is fitted to, with the three values given 
corresponding to the total, training, and validation datasets. 
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0.62 



Table 6: Same as Table [S] for individual experiments. All estimators have been obtained with 
Ai^rep = 1000 replicas. Note that experimental uncertainties are always given in percentage. For 
reference we also provide the NNPDF2.1 NLO x^ for the various experiments. 
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Figure 15: Distribution of training lengths over the sample of N^^p = 1000 replicas. 



All statistical indicators (including the training length), and in particular the quality 
of the global fit as measured by the value Xtot ~ ^-^^ are quite similar to those of the 
NLO fit. Specifically, the NLO and NNLO y^ differ by less than 10% for all experiments, 
except SLAG, the W asymmetry and CDF jet data (for which NNLO is better) and the Z 
rapidity distribution (for which it is worse). It is interesting to observe that an excellent 
description of the HERA F2 data is obtained without the need of any ad hoc cut or tuning 
of the treatment of heavy quarks (the NNLO x^ is somewhat worse than the NLO one. 
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Figure 16: Comparison of NNPDF2.1 NLO and NNLO singlet sector PDFs, computed using 
.^rcp = 1000 replicas from both sets. All error bands shown correspond to one sigma. 



but at NNLO the dataset is considerably wider, as discussed in Sect. 12. 2|) . 

The distribution of x^^^-* , -E'tV ' ^^^ training lengths among the A^rep = 1000 NNPDF2.1 
NNLO replicas are shown in Fig. [T3] and Fig. [15] respectively. While most of the replicas 
fulfill the stopping criterion, a fraction (~ 20%) of them stops at the maximum training 
length iV™^ which has been introduced in order to avoid unacceptably long fits. This 
fraction is comparable but somewhat larger than the corresponding NLO one. In order to 
check that this causes no significant loss of accuracy, we have verified that if all replicas 
that do not stop dynamically are discarded, the PDF change by an amount which is 
smaller than a statistical fluctuation. We have also verified that this fraction is reduced 
if the maximum training length is raised, thereby showing that the issue is merely one of 
computational efficiency, rather than principle. 



6.2 Parton distributions 

The NNPDF2.1 NNLO parton distributions are shown along with their NLO counterparts 
in Figs. [16] and [T7] at the input scale Qq = 2 GeV^, in the basis in which they are 
parametrized. The distances (defined as in Appendix A of Ref. [6]) between the NLO and 
NNLO sets are shown in Fig. [THl 

Recalling that a distance d ~ 1 corresponds to statistical equivalence, while (with 100 
replicas) d ~ 7 is a one sigma shift, it is apparent that the NLO and NNLO sets are 
statistically inequivalent, but differ by typically less than one sigma. This in particular 
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Figure 17: Same as Fig. [11] for the non-singlet sector PDFs. 

means that PDFs in the NNPDF2.1 set are quite stable when going from NLO to NNLO. 
The largest variations are observed for quarks at x ~ 0.1, while the small x PDFs (gluon 
and light quark sea) are very similar to their NLO counterparts. It is worth noting that the 
distances for the PDF uncertainties in Fig. [18] are particularly small. This is as it should 
be, consistently with the fact that the quality of the NLO and NNLO fits are similar, given 
that theory uncertainties are not included in PDF uncertainties. 

In order to assess the impact of NNLO corrections on physical observables it is useful 
to compare NNLO and NLO PDFs for individual flavours at a typical hard scale. This 
is done in Fig. [T9l where the NNLO/NLO ratio is shown as a function of x at Q'^ = 10^ 
GeV^. The most noticeable changes are larger small x quarks (and correspondingly, due 
to evolution, larger small x gluons) and smaller large x quarks. The biggest differences 
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Figure 18: Distances between tlie NNPDF2.1 NLO and NNLO parton sets sliown in Figs. [H] 
and [171 All distances are computed from sets of N^^p — 100 replicas. 
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Figure 19: Comparison between NNPDF2.1 NLO and NNLO light quark and gluon PDFs at 
Q^ = 10'' GeV^. The results have been obtained with A^rcp = 1000 replicas. All curves are shown 
as ratios to the central NNPDF2.1 NNLO result. 



are observed for the light quark sea at x ~ 10 ^, where the NNLO and NLO bands just 
about miss each other. 

Next, in Figs. [20] and ED we compare the NNPDF2.1 NNLO PDFs to those from 
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Figure 20: The NNPDF2.1 NNLO singlet sector PDFs, compared to MSTW08 PDFs. The residts 
for NNPDF2.1 NNLO have been obtained with N^-^p — 1000 rephcas. All PDF errors are given as 
one sigma uncertainties. In the comparison a common value of as {Mz)=0.ll9 has been used. 



the MSTW08 NNLO set. For consistency we use in the comparison a common value 
of Us {Mz)=0.119. The MSTW08 NNLO gluon, unlike its NNPDF2.1 counterpart, is 
unstable at small x, where it becomes very negative. For other PDFs there is reasonable 
agreement for central values, although the uncertainty bands from MSTW often seem 
unusually small. Sizable differences are observed in the strange distribution, but it should 
be recalled that in MSTW08 the parametrization of the s + s and especially s — s PDF 
is extremely restrictive, while in NNPDF2.1 they are treated on the same footing as the 
other PDFs. 

At present, MSTW08 is the only NNLO PDF set which is publicly available through 
the LHAPDF |63ll64j interface for a variety of values of ag. However, it may also be 
interesting to compare the NNPDF2.1 NNLO PDFs to the ABKM09 NNLO set (with 
fixed flavour number nj = 3) [65]. This set is only provided for as {Mz) = 0.1135 it 0.0014, 
furthermore for this set (and its NLO counterpart) only combined PDF+a^ uncertainties 
can be determined, unlike other sets for which PDF uncertainties with fixed Ug may also 
be computed. The comparison is shown in Figs. [22] and [23] at Qg = 2 GeV^, where we 
have chosen the NNPDF2.1 set with Ug = 0.114 in order to make the comparison more 
significant. Even so, the agreement is generally not very good. 
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Figure 21: Same as Fig. [^Hlfor the non-singlet sector PDFs. 
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Figure 22: The NNPDF2.1 NNLO singlet sector PDFs, compared to the ABKM09 three-flavor set. 
The resuhs for NNPDF2.1 NNLO have been obtained with iV,.ep = 100 rephcas. The NNPDF2.1 
set with as — 0.114 is shown because ABKM PDFs are only available for this value of a^. Note 
that for ABKM uncertainties also include the uncertainty on as while for NNPDF they are pure 
PDF uncertainties. 
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Figure 23: Same as Fig. [33] for the non-singlet sector PDFs. 
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Figure 24: Comparison of NNPDF2.1 LO, NLO and NNLO singlet sector PDFs at Ql=2 GeV^. 
All uncertainty bands are defined as 68% confidence levels. 



7 Perturbative stability 

With PDF sets at LO, NLO and NNLO determined from the same data and using a 
uniform methodology we can address issues of perturbative stability. We will do this 
first by comparing individual PDFs, and then by looking at the behaviour of the total 
momentum fraction carried by partons. 



7.1 Parton distributions 

We assess the perturbative stability of the PDF determination by comparing the NNPDF2.1 
PDFs as they go from LO to NNLO accuracy in perturbative QCD. The LO, NLO and 
NNLO NNPDF2.1 PDFs are compared in Figs. [2l and [25] at the starting scale Ql=2 GeV^ 
in the basis in which they are independently parametrized by neural networks. All error 
bands shown are defined as 68% confidence levels, rather than as standard deviations, so 
that possible deviations from gaussian behaviour are accounted for. In Figs. [2U] and [271 we 
provide a similar comparison but this time at the scale Q^= (100 GeV)^ in the basis of 
individual flavours. 

The excellent convergence of the perturbative expansion within the kinematic region 
covered by the experimental data is clear from these plots. In particular, even in the 
small X and large x region, where we expect perturbation theory to become unstable and 
resummation to be necessary |66[l67j. no evidence of instability is seen in the PDFs, thus 
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suggesting that resummation corrections are smaller than current PDF uncertainties (at 
small X, this is borne out by the dedicated study of Refs. [68!,'69]). 

It is also clear that the NNLO and NLO results for all PDFs almost always agree within 
uncertainties. In particular, with one single exception, at the starting scale (Figs. [2Hl25|) 
the NNLO central value is within (or just outside) the NLO uncertainty band, and in fact 
it differs from the NLO central value by an amount which is usually much smaller than 
the NLO uncertainty. The exception is the isospin triplet distribution around the valence 
peak X ~ 0.3, where the NLO and NNLO bands overlap, but the NNLO central value 
is clearly outside the NLO band. At a higher scale (Figs. [26ll27p the situation further 
improves, and the NLO and NNLO results become almost indistinguishable, and only the 
small discrepancy in light quark distributions for x > 10~^ already observed in Fig. \T9\ 
remains. 

This leads to an important conclusion. At present, the PDF uncertainties provided by 
NNPDF, and indeed all other PDF groups, only reflect the data uncertainties: in particular 
they do not include the theoretical uncertainty due to higher perturbative orders, which 
could be estimated by varying the renormalization and factorization scale during the PDF 
fit. At NLO we can estimate the theoretical uncertainty by a direct comparison with 
the NNLO results. This comparison shows that at NLO (and beyond) it is at present 
generally a reasonable approximation to neglect the theoretical uncertainty, since it is 
usually smaller than the PDF uncertainty coming from uncertainties in the data. 

On the other hand, at the starting scale the LO PDFs differ by many standard devi- 
ations from NLO PDFs. One must conclude that at LO the PDF uncertainty provided 
with NNPDF PDFs (as well as with any other available LO set) is only a fraction of the 
total uncertainty, the theoretical component here being the dominant one. The situation 
improves somewhat at high scale (Figs. [26ll27p . but the difference between LO and NLO 
remains large for the gluon. 

Finally, all seven independently parametrized LO, NLO and NNLO PDFs are collected 
in a single plot in Fig. [28] at a low scale Q^=2 GeV^ and in Fig. [29] at higher scale 
Q^ = 10^ GeV^. These plots illustrate the relative size of individual PDFs. Note that 
at high scale the plot has a log scale on the vertical axis because due to perturbative 
evolution the different PDFs can differ by several orders of magnitude. 
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Figure 25: Same as Fig. [M] for the non-singlet sector PDFs. 
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Figure 26: Comparison of NNPDF2.1 LO, NLO and NNLO PDFs in the flavour basis at Q^ 
fO'' GcV^: light quarks and gluon. 
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Figure 27: Comparison of NNPDF2.1 LO, NLO and NNLO PDFs in the flavour basis at Q^ 
10'' GeV^: strange and heavy quarks. 
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Figure 28: Summary of the NNPDF2.1 LO (top), NLO (center) and NNLO (bottom) PDF sets 
at (3^=2 GeV^. All uncertainty bands are defined as 68% confidence levels. 
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Figure 29: Same as FigUHlat Q^ = 10^ GeV'^. Note the logarithmic scale on the y axis. 
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Figure 30: Distribution of total momentum integrals for the A^rcp = 100 replicas in the NNPDF2.1 
LO*, NLO* and NNLO* PDF sets; ^..(M^) =0.119 in aU cases. 



7.2 The momentum of quarks and gluons in the nucleon 

The value of the total momentum carried by quarks and gluons and its dependence on the 
perturbative order provide a strong consistency check of the perturbative QCD framework. 
With the aim of testing this, we have performed NLO and NNLO PDF determinations in 
which the momentum sum rule is relaxed, denoted as NLO* and NNLO*, which supple- 
ment the LO* fit of Sect. [5j We take as{Mz) =0.119 at all perturbative orders. In all 
cases, we find that the fit quality is not changed in a significant way when relaxing the 
momentum sum rule. 

The momentum fraction carried by a parton distribution is 



[q]{Q^)= I dxxq{x,Q^) 
Jo 



(4) 



Using the LO*, NLO* and NNLO* PDF sets, we find that the total momentum [M] 
[S] + [g] carried by partons is 

[M]lo = 1.161 ±0.032, 
[M]nlo = 1-011 ±0.018, 



(5) 



[M] 



NNLO 



1.002 ±0.014. 



where the uncertainty is only from PDFs (and thus does not include any theoretical 
uncertainty). The distributions of total momentum integrals over the 100 replicas for the 
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PDF combination 


LO* 


NLO* 


NNLO* 


P + 5] 


1.161 ±0.032 


1.011 ±0.018 


1.002 ±0.014 


Ql = 2 GeV" 


n (Q^) 


0.550 ±0.025 


0.591 ±0.010 


0.602 ±0.010 


[g] m 


0.612 ±0.028 


0.421 ±0.021 


0.400 ±0.018 


[{u + u)]{Ql) 


0.346 ±0.015 


0.371 ±0.005 


0.376 ± 0.005 


\d + d)- {QD 


0.192 ±0.011 


0.206 ± 0.005 


0.209 ± 0.003 


\{s + s)]{Ql) 


0.012 ± 0.004 


0.014 ±0.006 


0.017 ±0.006 




Q' = 10^ 


GeV^ 




n iQ'') 


0.546 ±0.017 


0.528 ±0.009 


0.527 ±0.005 


[g] m 


0.615 ±0.020 


0.486 ±0.018 


0.475 ±0.011 


[{u + u)]{Q^) 


0.264 ± 0.009 


0.256 ±0.003 


0.255 ±0.002 


\d+S)' (Q2) 


0.165 ±0.007 


0.160 ±0.002 


0.159 ±0.001 


[{s + s)]{Q^) 


0.048 ± 0.003 


0.047 ± 0.004 


0.048 ± 0.004 


[(c + c)](Q2) 


0.041 ±0.002 


0.039 ± 0.002 


0.039 ±0.001 


[h + h) (Q2) 


0.027 ±0.001 


0.025 ±0.001 


0.025 ±0.001 



Table 7: Momentum fractions of various PDF combinations at low scale Qg 
scale Q^ = W^ GeV^ when the momentum sum rule is not imposed (LO*, 
PDF sets). All results are obtained with A^rep = 100 replicas. 



= 2 GeV^ and high 
NLO* and NNLO* 



NNPDF2.1 LO*, NLO* and NNLO* sets is shown in Fig.[30l they appear to be Gaussian 
to a good approximation. 

Estimating the theoretical uncertainty as the difference between results at two subse- 
quent perturbative orders, we see that at LO the theoretical uncertainty is dominant, as 
we already concluded from the PDF plots Figs. [2HI25] in Sect. [71 The deviation of the 
LO momentum integral from the QCD prediction is mostly driven by the gluon, which 
turns out to be larger in the LO* set than in the default LO set with momentum sum 
rule imposed. On the other hand already at NLO the theoretical uncertainty is half of 
the PDF uncertainty, A* [-^]nlo ~ 0.01, and thus at NNLO the theoretical uncertainty 
is likely to be negligible. 

It is also interesting to determine the momentum fraction carried by individual PDFs. 
These are tabulated in Tables [7][8] at a low scale Qq = 2 GeV^ and at a high scale Q^ = 
10^ GeV2, both before (Table H * PDF sets) and after (Table El standard PDF sets) 
imposing the momentum sum rule. They are also plotted as a function of scale in Fig. [311 
We show the momentum fractions of the light quarks, the gluon, and the total quark 
singlet combination. 

The asymptotic values of the momentum carried by the total quark and gluon distri- 
butions are predicted in perturbative QCD to be 



lim 



mQ' 



3n 



/ 



16 + 3n/ 



0.5294; 



lim [g]iQ') 



16 



16 ± 3n/ 



0.4706 (6) 



(see e.g. Ref. |7D]). The results of Tables [351 are in impressive agreement with the QCD 
prediction Eq. ^. When the momentum sum rule is imposed (Table [51) the accuracy of the 
determination of each momentum component improves, and the agreement with the QCD 
prediction Eq. ^ improves accordingly. In Fig. [32lwe compare for the NNPDF2.1 NNLO 
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PDF combination 


LO 


NLO 


NNLO 


P + 5] 


1 


1 


1 


Ql = 2 GeV" 


s {QD 


0.521 ±0.023 


0.590 ± 0.009 


0.609 ±0.013 


[g] iQl) 


0.479 ± 0.022 


0.411 ±0.009 


0.391 ±0.012 


[{n + u)]{Ql) 


0.328 ±0.012 


0.371 ± 0.005 


0.381 ± 0.007 


{d + d} (Ql) 


0.181 ±0.010 


0.206 ± 0.004 


0.211 ±0.005 


[{s + s)]{Ql) 


0.012 ±0.005 


0.013 ± 0.006 


0.017 ±0.005 




Q" = 1C 


)4 GeV^ 




P] (Q^) 


0.492 ±0.010 


0.523 ± 0.003 


0.529 ± 0.004 


[g] m 


0.509 ±0.010 


0.477 ± 0.003 


0.471 ± 0.005 


[{n + u)]{Q^) 


0.245 ± 0.007 


0.255 ± 0.003 


0.257 ±0.004 


'{d + d)' {Q^) 


0.150 ± 0.006 


0.159 ±0.002 


0.159 ± 0.002 


[{s + s)]{Q') 


0.041 ± 0.003 


0.046 ± 0.003 


0.048 ± 0.002 


[(c + c)](g2) 


0.033 ± 0.001 


0.0383 ± 0.0004 


0.0393 ± 0.0006 


'{b + b)' (Q^) 


0.021 ±0.001 


0.0245 ± 0.0002 


0.0249 ± 0.0003 



Table 8: Same as Table [71 but when the momentum sum rule is imposed (LO, NLO and NNLO 
PDF sets). 

and NNLO* fits the gluon and singlet momentum fraction and their ratio with the corre- 
sponding asymptotic values predicted by pQCD. This confirms the excellent agreement, 
both with and without the momentum sum rule imposed. 

It is interesting to observe that even though the limits Eq. ^ hold at any perturbative 
order, because of asymptotic freedom, the results of Tables 018] agree better with the limits 
as the perturbative order increases. This is due to the fact that the uncertainties given in 
the tables do not include the theoretical uncertainty, which decreases as the perturbative 
order increase. Indeed, comparison of central values of the various momentum fractions 
at subsequent perturbative orders shows that at NLO this uncertainty is small, but not 
entirely negligible, as already concluded in Sect. 17.11 while at NNLO it is likely much less 
than the PDF uncertainties. 
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Figure 31: Momentum fractions carried by various combinations of quark and gluon distributions 
as a function of the scale Q^ at NNLO, when the momentum sum rule is (top) or is not (bottom) 
imposed. 



44 



Momentum fractions in NNPDF2.1 NNLO 
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Figure 32: Momentum fractions carried by quarks and gluons, and their ratio, as a function of the 
scale Q^ at NNLO, when the momentum sum rule is (top) or is not (bottom) imposed, compared 
to the asymptotic pQCD predictions, Eq. (jH]). 
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8 Phenomenological implications 

We will now present the parton luminosities determined from NNPDF2.1 NNLO PDFs, 
and use them to compute several LHC standard cross-sections. As we shall see, the first 
LHC data already have discriminating power between existing PDF sets. They are thus 
likely to lead to a substantial improvement in PDF accuracy in the near future. 

8.1 Parton luminosities 

At a hadron collider, all factorized observables depend on parton distributions through a 
parton luminosity, which, following Ref. [59], we define as 

$., (Mi) = - [' ^n (xi, A4) /, (r/:ri. Mi) , (7) 

where fi{x,M'^) is a PDF and r = M'j^/s. Therefore, a good deal of information on the 
dependence of hadron-level cross-sections on PDFs can be gathered by simply looking at 
the luminosities which correspond to individual parton subprocesses. We consider in par- 
ticular the gluon-gluon luminosity, the total quark-gluon and quark-antiquark luminosities 
defined as 

j=l j=l 

and the charm and beauty quark-antiquark luminosities. 

The luminosities computed from the NLO and NNLO NNPDF2.1 PDF sets are com- 
pared in Fig. [551 all normalized to the NNPDF2.1 NNLO central value. The compatibility, 
and thus the perturbative stability, is good for of all luminosities, as expected from the 
PDF comparison of Figs. [TBlll8[ In particular, the gluon-gluon luminosity, which is rel- 
evant for Higgs production at the LHC, is quite stable in the region which corresponds 
to standard Higgs production, though for larger invariant masses the NNLO luminosity 
becomes smaller. Non-negligible differences are seen for the quark-antiquark luminosity, 
which is significantly larger at NNLO in the region relevant for W and Z production, as 
a consequence of the discrepancy in light quark distributions already noted in Fig. [T9l 
which gets squared at the level of luminosities. Similar but somewhat smaller differences 
are seen in the quark-gluon channel. The heavy quark PDFs follow the behaviour of the 
gluon, from which they are generated dynamically via perturbative evolution. Note that 
the masses of the heavy quarks rric and ttt-^ are the same in the NLO and NNLO analyses. 

In Fig. [Ml the NNPDF2.1 NNLO luminosities are compared to MSTW08 NNLO (at 
a common value of a^ = 0.119), all plotted as ratios to the NNPDF2.1 NNLO central 
value. The agreement is generally quite good in the region of M\I s which corresponds 
to typical electroweak final state masses Mx at the LHC, but it deteriorates for very low 
and especially very high Mx- This is mostly a consequence of the strikingly different 
behaviour of the singlet and especially the gluon distribution at small x seen in Fig. [20l 
related to the unstable behaviour of the MSTW08 NNLO gluon. 

8.2 Predictions for LHC observables 

NNLO computations are at the current frontier of perturbative QCD, and are thus avail- 
able for only a small number of processes. We will now present NNLO results for the 
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Figure 33: Comparison of the parton luminosities Eqs. (I7][H1) for LHC at 7 TeV, computed using 
the NNPDF2.1 NLO and NNLO PDFs, using N.^p = 100 repHcas from both sets. From left to 
right we show $gg, ^qg, (top) $,,, ^cc, (middle) $^^, $(,g (bottom). All luminosities are plotted 
as ratios to the NNPDF2.1 NNLO central value. All uncertainties shown are one sigma. 

total cross-section for Higgs and weak vector boson production. We will consider also the 
approximate NNLO computation of the ti production cross-section. 

When comparing predictions for physical observables with the aim of understanding 
their dependence on PDFs and their associated uncertainty it is important not to mix the 
uncertainties due to PDFs with uncertainties due to the choice of external parameters [71] . 
In particular, collider observables have a nontrivial dependence on the value of the strong 
coupling, both through the hard matrix elements and due to the correlation of PDFs with 
the value of Og |72fl74j . which is particularly significant for the gluon distribution. As a 
consequence, as already discussed in Sect.[6l the only NNLO PDF set with which a detailed 



47 



PDF Uncertainty parton lumia - LHC 7 TeV - Ratio to NNPDF2.1 NNLO 



PDF Uncertainty parton lumis - LHC 7 TeV - Ratio to NNPDF2.1 NHLO | 




■'°" (WF/Sy^ 



PDF Uncertainty parton iumis - LHC 7 TeV - Ratio to NNPDF2.1 NNLO 



( M^ / s y^ 




{ WP / s f- 

PDF Uncertainty parton lumis - LHC 7 TeV - Ratio to NNPDF2.1 NNLO | 





( ruf / s f^ 




PDF Uncertainty parton lumis - LHC 7 TeV - Ratio to NNPDF2.1 NNLO | 



■'"" {WIS)'" 



Figure 34: Same as Fig. [33 but for NNPDF2 
sets PDFs corresponding to the same value as 




( I* / S )™ 



.1 NNLO and MSTW2008 NNLO PDFs. For both 
= 0.119 have been used for consistency. 



quantitative comparison is currently possible is MSTW08. For the sake of illustration, 
however, we will also present comparisons with the ABKM09 NNLO set, even though it 
should be kept in mind that these PDFs are provided for as{Mz) = 0.1135 it 0.0014, and 
their uncertainties always include also the uncertainty due to the variation of a^ in this 
range. 

We consider first the total inclusive cross-section for Higgs production from gluon-gluon 
fusion, whose dependence on PDFs has attracted considerable attention recently, in view 
of claims in the literature (see Ref. [75] and references therein) that the recommended [76] 
determination of PDF uncertainties through the so-called PDF4LHC prescription [77] 
might be substantially underestimated (see Refs. [l5] and especially Ref. ^49] for a thorough 
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Figure 35: The total cross-section for Higgs boson production via gluon fusion at NNLO as a 
function of m^. Results are shown for NNPDF2.1 with a^iMz) = 0.119 and as{Mz) = 0.117, 
MSTW08 with UsiMz) = 0.1171, and ABKM09 with as{Mz) = 0.1135 ± 0.0014, all displayed as 
ratios to the central NNPDF2.1 curve. The NNPDF result is obtained using A^rcp = 100 replicas. 
All uncertainties shown are one sigma; for NNPDF and MSTW they are pure PDF uncertainties, 
while for ABKM they also include the as uncertainty corresponding to their given range. 

discussion of this issue). We have computed the NNLO Higgs production cross-section in 
the gluon fusion channel using the code of Refs. [78y79j. Results are shown in Fig. [35] as a 
function of the Higgs mass mn, determined using the NNLO NNPDF2.1, MSTW08 and 
ABKM09 PDF sets. The predictions for each group are shown at the respective default 
value of a,, namely asiMz) = 0.119 for NNPDF, as{Mz) = 0.1171 for MSTW08 and 
as{Mz) = 0.1135 ± 0.0014 for ABKM09; for NNPDF the prediction for as{Mz) = 0.117 
is also shown in order to allow for a direct comparison with MSTW08 (NNPDF2.1 NNLO 
predictions for more values of Og are shown and discussed in Fig. [39] of Sect. 19.11 below). 
All uncertainties shown are one sigma PDF uncertainties only (but in the case of ABKM, 
as mentioned, they include also the uncertainty on a^). The NNPDF and MSTW results 
are in excellent agreement, provided the same value of a^ is used. On the other hand, the 
ABKM result disagrees by many standard deviations: even though a sizable fraction of 
this disagreement is due to the different value of a^, it would persist even when the same 
value of Us is adopted (see Fig. [39] below). 

Another LHC process which is very sensitive to QCD dynamics is top production. Full 
NNLO corrections for this process are not available, however, approximate NNLO expres- 
sions based on threshold resummation have been constructed [82] and implemented in the 
public HATHOR code [83] . In Fig. [36] we show the prediction for the total tt cross-section 
determined at NLO and NNLO at the LHC 7 TeV with mt =172 GeV (pole mass). Theo- 
retical predictions are shown for NNPDF2.1 at NLO and NNLO using the corresponding 
parton sets with as{Mz) = 0.119; they are compared to the NLO and NNLO MSTW08 
predictions which have as{Mz) = 0.1202 at NLO and as{Mz) = 0.1171 at NNLO, and to 
the ABKM NNLO prediction which has as{Mz) = 0.1135 ±0.0014, by also displaying the 
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Figure 36: The total cross-section for tt production at the LHC 7 TeV computed using HATHOR 
(left) and its ratio to the total Z production cross-section computed using VRAP (right). Results 
are shown for NNPDF2.1 with asiMz) = 0.119 (NLO and NNLO), as{Mz) = 0.120 (NLO) and 
asiMz) = 0.114,0.117 (NNLO), MSTW08 with a,(A/z) = 0.1202 (NLO) and a^iMz) = 0.1171 
(NNLO), and ABKM09 with asiMz) = 0.1135±0.0014 (NNLO). The NNPDF results are obtained 
using A^rcp — 100 replicas. All uncertainties shown are one sigma; for NNPDF and MSTW they 
are pure PDF uncertainties, while for ABKM they also include the as uncertainty corresponding 
to the given range. The band corresponds to the combination of the most recent CMS and ATLAS 
measurements (left, see text) and to the CMS measurement [HI] (right). 



NNPDF results with as{Mz) = 0.120 (NLO) and as{Mz) = 0.114,0.117 (NNLO). These 
theoretical predictions can be compared to the average of the recent measurements from 
CMS imiBl], a{ti) = 158 ± 19 pb, and ATLAS [85], a (ti) = 180 ± 19 pb. Averaging 
the most accurate results, which have been obtained with a luminosity of ~ 36 pb~^, and 
assuming that the two measurements are independent, yields a (tt) = 169 it 13 pb (shown 
in Fig. [36] as a dashed band) . 

Figure (361 shows that the NNPDF2.1 and MSTW08 predictions are in good agreement 
both at NLO and NNLO: however, once again, it is important that a common value of as 
be used. Also, of course, one should remember that the uncertainties shown in Fig. 1361 are 
only PDF uncertainties. In particular theoretical uncertainties, such as may be estimated 
by scale variation, and uncertainties due to the dependence on the top mass, are not shown 
and might also be significant. Again, the ABKM09 prediction is significantly lower, and 
the disagreement persists even when a common value of Os is adopted: already the LHC 
data are starting to discriminate between PDF sets. 

A closely related recent measurement by CMS [81] is the ratio of ti and Z cross- 
sections. We have computed predictions for this observable using the VRAP code [53| 
together with HATHOR, for the same PDF sets and settings. At NNLO this ratio is only 
weakly dependent on the value of a^. Results are also shown in Fig. [36] and compared to 
the CMS measurement, again shown as a dashed band. The conclusions are similar. 

Finally, we consider the total cross-sections for electroweak gauge boson production 
and their ratios at the LHC. We have computed these observables with the VRAP code [53], 
within the narrow-width approximation (including the 7* contribution to gauge boson 
production). Results are collected in Fig. [37] As for the case of top production, we 
show NNPDF2.1 at NLO and NNLO, using our preferred value a^ = 0.119, and also 
compare NNPDF2.1 with MSTW08 at NLO and NNLO, and ABKM09 at NNLO, using 
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ATLAS 


CMS 


Average 


a{W+)B{l+iy) (nb) 


6.26 ±0.32 


6.02 ±0.26 


6.11 ±0.20 


a{W-)B{l-iy) (nb) 


4.15 ±0.21 


4.26 ±0.19 


4.21 ±0.14 


(7{Z")B{l+l-) (nb) 


0.945 ±0.051 


0.975 ±0.044 


0.962 ±0.033 


cr {W+ + W-) B {lu) /a (Z") B {l+r) 


10.91 ±0.28 


10.54 ±0.19 


10.65 ±0.16 


(t{W+)/cf{W-) 


- 


1.421 ±0.034 


1.421 ±0.034 



Table 9: Recent results from CMS at ATLAS for the total cross-sections for W^ , W" and Z° 
production and their ratios, obtained with an integrated luminosity of ~ 36 pb~^, together with 
their average. The average has been obtained assuming the two measurements to be completely 
uncorrelated. 



their preferred values of ag- Again, we have averaged the most updated CMS [SB] and 
ATLAS [57] results for these observables, assuming they are uncorrelated. The individual 
ATLAS and CMS results, corresponding to an integrated luminosity of 36 pb~^, together 
with their average are summarized in Table [9j The results are shown as dashed bands on 
the plots. Note that for cross-section ratios the uncertainty shown is computed as a 68% 
confidence level, because we have verified that the distribution of results can be markedly 
non-gaussian. 

The dependence on q^ is weaker for these processes, which are quark-dominated, inde- 
pendent of the strong coupling at Born level, and affected by smaller NNLO corrections. 
However, the dependence of the predictions on the perturbative order is still not neg- 
ligible. Differences between PDF sets are also less significant, except for the W^ /W~ 
cross-section ratio which is a very sensitive probe of the quark flavor decomposition. The 
general conclusions are similar to those from top production: the LHC data (in particular 
the W cross-section ratio) already show some discrimination between PDFs. Moreover, 
for some of these standard candle observables the LHC data may be able to discriminate 
between NLO and NNLO. 

As a general conclusion of this first pehnomenological study, we note that the pattern 
of comparison between PDF sets is essentially unchanged when going from NLO to NNLO. 
Therefore, the arguments supporting the PDF4LHC recommendation [77] (and specifically 
its use in the computation of Higgs exclusion limits [76]) apply equally at NLO and NNLO. 
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Figure 37: The total cross-sections for W+, W~ and Z° production at the LHC 7 TeV and their 
ratios (j{W~^ + W~)/a{Z'^) and a{W^)/a{W~). Predictions are shown for the same PDF sets and 
values of as shown in Fig. |36l and compared to the ATLAS and CMS data summarized in Table [9l 
shown as a dashed band. Uncertainties shown are one sigma for absolute cross-sections, and 68% 
confidence levels for cross-section ratios. 
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Figure 38: Comparison of NNPDF2.1 NNLO sets with different values of the strong couphng, 
shown as ratios to reference set with as = 0.119 at Q^ — 10^ GeV^: gluon at small and large x 
(top), triplet at large x and singlet at small x (bottom). To improve readability PDF uncertainties 
are only shown for the as — 0.119 set. 

9 Accuracy of the NNLO PDF determination 

The NNLO PDF determination is based on the most accurate available theory. It is there- 
fore worth discussing the dependence of results on the main sources of uncertainty. As 
shown in detail in our previous studies Ref. [6|[62], the main factor which drives PDF 
uncertainties is the underlying dataset. Also important are the values of the QCD pa- 
rameters used in the PDF extraction, primarily the value of the strong coupling a^, but 
also of the quark masses: their impact is sometimes comparable to that of the data, in 
that the uncertainty due to their variation is similar to the PDF uncertainty when they 
are kept fixed [SJSSlElllSn] • Here we study both aspects: first, the dependence of PDFs 
on parameters (specifically a^), by repeating the NNPDF2.1 NNLO determination as the 
underlying parameters are varied, and then the dependence on the size of the dataset by 
repeating the NNPDF2.1 NNLO determination with various subsets of the global dataset. 
Theoretical uncertainties will not be studied here: as we argued in Sect. [71 at NNLO the 
uncertainty related to higher order corrections (as might be estimated by renormaliza- 
tion and factorization scale variation) is usually subdominant, as are those related to the 
treatment of heavy quarks [38] . 
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Figure 39: Same as Fig. [HSl but for NNPDF2.1 NNLO sets with different values of as{Mz). 
Results are shown as ratios to the NNPDF2.1 NNLO reference with as{Mz) = 0.119. 



9.1 Dependence on as (Mz) 

In order to determine the dependence of PDFs on the value of as we have repeated the 
NNPDF2.1 NNLO determination as a^ is varied: we provide sets with a^ (Mz) in the 
range from 0.114 to 0.124 in steps of 0.001, each including A'^rcp = 100 PDF replicas. 
Results for the gluon, the quark singlet and isospin triplet are displayed in Fig. \3E[ where 
the ratio of the central PDFs for each value of as to the default as (Mz) = 0.119 set is 
shown, and compared to the PDF uncertainty on the central set. Results are qualitatively 
similar to their NLO counterparts of Ref. [71]: the PDF which is most sensitive to the 
choice of value of a^ is the gluon, which is mostly determined by scaling violations. 

As an illustration of the phenomenological impact of the choice of ag , in Fig. [321 we 
present results for the NNLO Higgs cross-section from gluon-gluon fusion (same as Fig. [35]) 
computed using these PDF sets with varying ag, again normalized to the central prediction. 
The dependence on as of this observable is quite strong, both because it is quadratic in 
the gluon, and also because it starts at 0{a'^), and it undergoes NLO corrections which 
are as large as the LO contribution, and NNLO corrections which are about half of the 
LO (see Ref. [l9l[80] for detailed studies of the relative size of PDF and as uncertainties 
on this process). 

Finally, in order to study the dependence on the value of the heavy quark masses, we 
have also produced PDF sets with nic = 1.5, 1.6 and 1.7 GeV (in addition to the default 
rric = \/2 GeV), and with mi, = 4.25, 4.5, 5.0 and 5.25 GeV (in addition to the default 
TTife = 4.75 GeV). The dependence of PDFs on the heavy quark masses is similar to that 
observed at NLO and discussed in Ref. [8j. 



9.2 Dependence on the dataset 

We now turn to the dependence of PDFs and their uncertainties on the underlying dataset. 
This dependence is relevant in that one may think that even though a wider dataset 
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Experiment 


Global 


HERA-only 


DIS-only 


DIS+DY 


Gollider-only 


iVdat 


3357 


834 


2783 


3171 


1090 


Total 


1.16 


1.07 


1.15 


1.18 


1.02 


NMC-pd 


0.93 


[13.15] 


0.88 


0.94 


3.43 


NMC 


1.63 


1.91 


1.69 


1.69 


2.06 


SLAG 


1.01 


3.17 


0.97 


1.03 


1.23 


BCDMS 


1.32 


2.15 


1.28 


1.30 


2.22 


HERAI-AV 


1.10 


1.05 


1.09 


1.09 


1.06 


CHORUS 


1.12 


2.63 


1.08 


1.13 


1.74 


FLH108 


1.26 


1.32 


1.27 


1.26 


1.26 


NTVDMN 


0.49 


[60.51] 


0.45 


0.54 


[23.02] 


ZEUS-H2 


1.31 


1.21 


1.26 


1.28 


1.30 


ZEUSF2C 


0.88 


0.77 


0.86 


0.88 


0.75 


H1F2C 


1.46 


1.30 


1.47 


1.50 


1.24 


DYE605 


0.81 


[9.06] 


[6.86] 


0.82 


[1.34] 


DYE866 


1.32 


12.41 


2.70 


1.32 


[5.76] 


CDFWASY 


1.65 


7.71 


[13.94] 


1.64 


1.07 


CDFZRAP 


2.12 


3.74 


2.15 


1.91 


1.22 


DOZRAP 


0.67 


1.11 


0.67 


0.65 


0.61 


CDFR2KT 


0.74 


1.15 


0.99 


1.25 


0.64 


D0R2CON 


0.82 


1.28 


0.88 


1.03 


0.83 



Table 10: Quality of the fit for NNLO PDF sets based on datasets of varying size. The total 
number of data points is given in the first row, followed by the x^ normalized to the number of 
data points both for the total fitted set and for each of the individual experiments. The y^ values 
for experiments which are not fitted are also shown in square brackets. 

always carries more information, smaller datasets may be more consistent (from either 
the experimental or theoretical point of view), and thus perhaps more reliable. It is 
thus important to understand what is the price that one pays for this putative improved 
consistency. 

To this purpose, we have constructed four new PDF sets, each based on a subset of the 
full NNPDF2.1 NNLO dataset, but using exactly the same methodology. The possibility 
of obtaining reliable PDFs from datasets of widely varying size (more than a factor three, 
in our case) without having to modify any aspect of the methodology (and in particular 
without having to change the parametrization, see Ref. [88]) is an advantage of the NNPDF 
approach, since it allows a meaningful comparison of uncertainties. Each of these PDF 
sets is made available through the standard LHAPDF interface, and each so far as it goes 
is as good as the default one, the only difference between them being the smaller amount 
of experimental information that goes into them. 

In particular the fits that we discuss in this section are, in order of increasing com- 
plexity: 

• HERA data only. This determination exploits a maximally consistent set of data: the 
combined HERA-I inclusive data, the HI and ZEUS F^ data and the ZEUS HERA- 
II data. PDFs based on this dataset have also been determined and published by 
the HERAPDF group [28]. 

• Deep-inelastic scattering (DIS) only. This determination does not include hadron- 
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Figure 40: Distances between central values (left) and uncertainties (right) for PDFs in the 
HERA-only and defauh NNPDF2.1 NNLO fits. AU distances are computed from sets of A'^rcp = 100 
replicas. 
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Figure 41: Comparison of NNPDF2.1 NNLO singlet, total strangeness and isotriplet PDFs in the 
global and in the HERA-only fits. 



hadron data, which one may perhaps consider theoretically or experimentally less 
clean than lepton-hadron data. 

Deep-inelastic scattering and Drell-Yan (DIS+DY) only. This determination is in 
principle the only truly NNLO one, as it excludes jet data, for which only approxi- 
mate NNLO matrix elements are known. In order to better understand the relevance 
of approximate NNLO jet corrections, we will also construct a set which only differs 
from the default one by setting to zero the approximate NNLO terms in jet matrix 
elements. 

Lepton and hadron collider data only. This determination excludes fixed-target data, 
which are less clean, both because of the lower energy and because a sizable fraction 
of them (in particular, all the neutrino DIS data) are obtained using nuclear targets. 
This determination is of greater complexity than DIS+DY, despite having a smaller 
number of datapoints, because it also includes jet data. 



In each case, a set of N, 



100 PDF replicas has been constructed. The NLO coun- 
terparts of the DIS and DIS+DY PDF determinations were discussed in Ref. [6] and are 



rep 



56 



Distance between central values, Nrep^lOO 

E ' ^ ^ ^ ~^ ^ ^ 

a NNPDF2.1 NNLOREFvs. DIS-only 

T3 

V 

As 



/y\..:- 



I ^ ^ ^ ^ ■ -^ 

1e-05 0.0001 0.001 0.01 





12 




10 


,__, 


8 


0° 

a- 
D 


6 


■0 


4 




2 









1 


1e 



-T^, 



Distance between PDF uncertainties, Nrep^lOO 



NNPDF2.1 NNLO REF vs. DIS-only 



y-u':M:\ 



05 0.0001 0.001 0.01 0.1 



Figure 42: Distances between central values (left) and uncertainties (right) for PDFs in the DIS- 



only and default NNPDF2.1 NNLO fits. All distances are computed from sets of N^-^ 
replicas. 



100 



available from LHAPDF both for NNPDF2.0 and NNPDF2.1; the HERA-only NLO PDFs 
were briefly discussed in Ref . [89] ; collider-only PDFs are presented here for the first time. 

The total x^ ^^^ that of the individual experiments for each of these PDF fits are 
shown in Table ITOt along with the number of data in each reduced dataset, and compared 
to those of the default NNPDF2.1 fit. In each case, the x^ of experiments not included in 
the corresponding fit is also shown, in square brackets. 

We now turn to a discussion of these fits, and in particular a comparison between each 
of them, the default NNPDF2.1, and the fit with immediately greater complexity, in order 
to assess the impact of individual data. These comparisons are based on the computation 
of distances (defined as in Appendix A of Ref. [B]) between PDFs in the two sets which 
are being compared, bearing in mind that d ~ 1 means statistical equivalence (the data 
added have no impact on the given PDF), while 1 < d < 7 (for A'rep = 100 replicas) means 
statistical inequivalence but compatibility at the one sigma level (the data added do have 
an impact, but only at the one sigma level). Some of the pairs of PDFs with the largest 
distances will also be compared directly. 

The HERA-only PDF set is subject to the limitation that charged-current DIS data 
are enough to determine at most four independent linear combinations of quark PDFs (see 
e.g. Ref. |90]). hence strangeness in this set is entirely uncertain — in the HERAPDF [28] 
set an independent parametrization is only provided for the combinations d + s and d + s, 
but not separately for strangeness. Indeed, for this fit the quality of the fit to NuTeV 
dimuon data is extremely poor. This set also provides a poor description of all datasets 
which are sensitive to the singlet-triplet separation (such as fixed-target DIS and DY data) 
to the light sea decomposition (such as W production data) and, to a lesser extent, the 
valence-sea separation (such as neutrino data). The description of the jet data is also of 
marginal quality. 

The distances between uncertainties (see Fig. H0|) for this fit and the default are largest 
for the triplet, strangeness, sea asymmetry, and valence. The distance between central 
values remains large for triplet and strangeness, but it is yet larger for the singlet and 
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Figure 43: Distances between central values (left) and uncertainties (right) for PDFs in the HERA- 
only and DIS-only NNPDF2.1 NNLO fits. AU distances are computed from sets of N^-^p = 100 
replicas. 
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Figure 44: Comparison of NNPDF2.1 NNLO singlet, total strangeness and total valence PDFs in 
the DIS-only and in the HERA-only fits. 



gluon: the large shifts in strangeness and triplet are accompanied by a corresponding 
increase in their uncertainty, while the increase in the uncertainty of the singlet and 
gluon is more moderate, so the change in central value ends up being statistically more 
significant, as can be clearly seen in the direct PDF comparison of Fig. HTJ All PDFs 
remain compatible with those of the global fit at the level of 90% confidence level, but the 
deviation of singlet and gluon is greater than one sigma. Phenomenology based on this 
PDF set is necessarily very uncertain in any contribution which depends on strangeness, 
as cross-sections using HERA-only PDF sets have a theoretical uncertainty of order of 
the size of the strange contribution, which for instance for W and Z production at the 
LHC is of order 15-25% of the total cross-section [91]. This uncertainty cannot be reduced 
regardless of the accuracy of the HERA data that go into the PDF determination. 

The most severe problems of the HERA-only fit disappear in the DIS-only fit, which 
thanks to the presence of data with neutrino beams or deuterium targets, achieves reason- 
ably accurate fiavor decomposition: indeed, distances between strangeness, valence and 
light sea asymmetry determined in this fit and those of the global fit, shown in Fig. H2t 
are rather smaller than one sigma (though uncertainties are still significantly larger), and 
the singlet and gluon are now in near-perfect agreement with those of the global fit, with 
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Figure 45: Distances between central values (left) and uncertainties (right) for PDFs in the 
DIS+DY and defauft NNPDF2.1 NNLO fits. All distances are computed from sets of iV^cp = 100 
replicas. 



only the gluon showing a deviation at the half-sigma level in the large x region. The 
improvement in accuracy in the flavour decomposition over the HERA-only fit is clear 
both in the distances between the uncertainties in these two fits (see Fig. I43p . and directly 
comparing PDFs for which the improvement is most dramatic: singlet, strangeness, and 
valence (Fig. IH]) . The triplet distribution also shows a significant decrease in uncertainty, 
but around the valence peak it only agrees with that of the global fit at the 90% confidence 
level: this may suggest some tension between deuterium DIS and hadron collider data (as 
has been discussed elsewhere [71[T3]), though it could also be a statistical fluctuation. 

As a consequence of all this, the quality of the agreement of the DIS-only PDFs with 
all Drell-Yan data shown in Table ITUI (specificallv W and Z production data) remains poor 
(at the level of many units of x^)> thus showing that a DIS-only PDF fit is not adequate for 
precision hadron collider phenomenology. On the other hand, it is interesting to observe 
that the gluon distribution determined by DIS scaling violations is in good agreement 
with that of the global fit, even in the large x region where jet data have an impact on its 
uncertainty. Even though the gluon uncertainty at large x is rather smaller in the global 
fit, the central value agrees at the half sigma level, and indeed the quality of the fit to jet 
data provided by the DIS-only fit is quite good, though of course not as good as when 
these data are also fitted. It is interesting to observe that other PDF fits based on reduced 
datasets (such as HERAPDF or ABKM) do not seem [19] to provide an equally good fit 
to jet data, presumably because of their less flexible PDF parametrization. 

Based on this, we expect the DIS-I-DY fit to be quite close to the default global fit. 
This expectation is borne out by the plot of the distance between these two fits. Fig. H5l 
In fact, almost all PDFs in this pair of fits are statistically equivalent. In comparison to 
the DIS-only fit (Fig. H6]l . the addition of the Drell-Yan data has a significant impact on 
all individual quark flavours, reducing the uncertainties of the flavour decomposition, as 
can also be seen by directly comparing PDFs (Fig. H7|) . 

The only significant impact of jet data is on the gluon at large x whose central value 
shifts by about half a sigma with a corresponding reduction in uncertainty when going 
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Figure 46: Distances between central values (left) and uncertainties (right) for PDFs in the DIS- 
only and DIS+DY NNPDF2.1 NNLO fits. All distances are computed from sets of N^^p = 100 
replicas. 
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Figure 47: Comparison of NNPDF2.1 NNLO isotriplet, total valence and sea asymmetry PDFs 
in the DIS-only and in the DIS+DY fits. 



from the DIS+DY to the global fit, also leading to a slight shift of the singlet and valence 
distributions. 

Because of all this, one may wonder whether it might be advantageous to use the 
DIS+DY fit as the default global NNLO fit, given that the NNLO jet matrix elements are 
only approximate. To address this issue, we have produced a NNLO fit in which the jet 
matrix elements are computed at NLO, everything else being as in the NNLO global fit. 
Distances between this PDF set and the default are displayed in Fig. HSl These distances 
are all smaller than the distances of Fig. US] between the reference and DIS+DY fits. 
Hence, the impact of NNLO corrections to the matrix elements is negligible in comparison 
to the impact of the jet data (however moderate this be). If one reasonably assumes that 
the difference between exact and approximate NNLO matrix elements is smaller, or at 
least not much larger, than the distance between NLO and approximate NNLO, one must 
conclude that there is no advantage in downgrading to the DIS+DY determination. Thus, 
the fit with jets at approximate NNLO (i.e. the NNPDF2.1 NNLO default) provides the 
most accurate PDF determination. 

The last PDF determination that we consider is based on a fit to collider data only. 
To be precise, we use the same datasets as in the HERA-only fits supplemented by the 
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Figure 48: Distances between central values (left) and uncertainties (right) for PDFs in the default 
NNPDF2.1 NNLO fit when approximate NNLO jet matrix elements are switched off. All distances 
are computed from sets of iV^p = 100 replicas. 



Tevatron weak-boson production and inclusive jet production data. The quality of the 
global fit in this case, with x^ = 1-02, is significantly better than that of the default global 
fit: this supports the idea that a collider-only dataset might be more consistent than one 
which also includes fixed-target data. 

However, the distances between the collider-only and global PDFs (Fig. H9]) show that 
they are far from being equivalent: in fact almost all the PDFs show a shift at the one 
sigma level, with uncertainties significantly larger in the collider-only fit. Correspondingly, 
the quality of the fit of the collider-only PDFs to the fixed target data is generally poor, 
especially for deuterium data (such as NMC-pd or DYE866), which control the up-down 
separation, and the dimuon data (which control strangeness). These increased uncertain- 
ties end up contaminating also the singlet and gluon PDFs, as shown by the distance 
computation and by direct PDF comparison (Fig. [50]) . Hence, we must conclude that 
a collider-only fit at present does not provide competitive accuracy for phenomenology 
due to its extremely poor determination of strangeness and light quark asymmetry, and 
rather poor determination of valence, triplet and even singlet PDFs. This situation might 
improve with forthcoming HERA-H and LHC data, so the collider-only fit remains an 
appealing possibility for the future. 
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Figure 49: Distances between central values (left) and uncertainties (right) for PDFs in the 
collider-only and default NNPDF2.1 NNLO fits. All distances are computed from sets of iViop = 100 
replicas. 
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Figure 50: Comparison of NNPDF2.1 NNLO gluon, singlet and triplet PDFs in the collider-only 
and reference fits. 
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10 Conclusions 

The family of NNPDF2.1 PDF determinations — including the NLO set of Ref. [S] and the 
NNLO and LO sets presented here — provides arguably the most accurate available PDF 
determination: it is based on a wider dataset than ABKM09 [BS] or HERAPDF [2S], unlike 
CTIO [14] it also includes NNLO PDFs, and it is based on a more up-to-date dataset and 
treatment of uncertainties than MSTW08 [131 i^^ particular it uses the combined HERA-I 
data, and correlated uncertainties for all the experiments which supply them). Of course, 
this is in part due to it being the most recent PDF set, and updates from the other PDF 
fitting groups are to be expected in the future. However, the NNPDF set is unique in that 
it is based on a methodology which has been demonstrated I7l[62] to lead to PDFs and 
uncertainties which behave in a statistically consistent way and minimize parametrization 
bias. 

In Ref. 190] it was proposed that an ideal parton set should satisfy a set of general 
requirements: NNPDF2.1 now satisfies essentially all of them. Its only shortcoming is 
that, like all other parton sets to date, it does not include an estimate of the uncertainty 
related to the truncation of the perturbative expansion (as given for instance by perform- 
ing renormalization and factorization scale variation). However this is only a problem 
at NNLO, since at LO and NLO the availability of a higher-order set already provides 
this information. Furthermore there is every indication that at NNLO the theoretical 
uncertainty is small compared to the data uncertainties. 

With the most relevant methodological and conceptual issues now solved, the emphasis 
now shifts to making an optimal use of the data which have started to flow so copiously 
from the LHC experiments. 



Ah the NNPDF2.1 NNLO and LO PDF sets that have been discussed in this work are 
available from the NNPDF web site, 



http : //sophia . ecm . ub . es/nnpdf 



and through the LHAPDF interface [63] from v.5.8.6-beta onwards. The older NNPDF2.1 
NLO PDF sets (called simply NNPDF2.1 on LHAPDF), described in detail in Ref. [S], 
remain unchanged. 

Specifically, the new PDF sets that have been produced in the present analysis and 
are available in LHAPDF are the following: 

• The reference NNPDF2.1 NNLO sets, sets of A^rep = 100 and 1000 rephcas: 
NNPDF21_nnlo_100.LHgrid and NNPDF21_nnlo_1000.LHgrid. 

• NNPDF2.1 NNLO sets of A^cp = 100 replicas with a^ varied from 0.114 to 0.124 
with steps of 6as = 0.001: 

NNPDF21_nnlo_as_0114_100.LHgrid, . . ., NNPDF21_iinlo_as_0124_100.LHgrid. 
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NNPDF2.1 NNLO sets of iVrcp = 100 replicas with nic varied: 
NNPDF21_nnlo_mc_15_100 . LHgrid, NNPDF21_rinlo jic_16_100 . LHgrid, 
NNPDF21_nnlo_mc_17_100 . LHgrid. 

NNPDF2.1 NNLO sets of N^-cp = 100 replicas with rub varied: 
NNPDF21_niilo jib_425_100 . LHgrid, NNPDF21_nnlo jib_45_100 . LHgrid, 
NNPDF21_nnlo_mb_50_100 . LHgrid, NNPDF21_nnlo jib_525_100 . LHgrid. 

The NNPDF2.1 NNLO PDF sets based on reduced datasets: 
NNPDF21_dis_100 . LHgrid, NNPDF21_dis+dy_100 . LHgrid, 
NNPDF21Jieraonly_100 . LHgrid, NNPDF21_collider_100 . LHgrid. 

The reference NNPDF2.1 LO sets, sets of TVrep = 100, for alphas=0.119 and 0.130: 
NNPDF21_lo_as_0119_100. LHgrid and NNPDF21_lo_as_0130_100. LHgrid. 

The NNPDF2.1 LO* sets, without the momentum sum rule imposed, sets of Nj-^p = 

100, for alphas=0.119 and 0.130: 

NNPDF21_lostar_as_0119_100. LHgrid and NNPDF21_lostar_as_0130_100. LHgrid. 
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A Heavy quark coefficient functions to O (a^) in Melfin space 

In this Appendix we provide expressions for the Melhn transforms of the O (a^) massive 
heavy quark coefficient functions in the Q^ — )• oo hmit. These asymptotic coefficient 
functions were first computed long in Ref. [92] in j;-space. Their Melhn transforms have 
been given in Refs. J1031I104] . and will be rederived here in a form which is suitable for 
our purposes. 

In order to perform the Mellin transform of the x-space FFN heavy quark coefficient 
functions of Ref. |92] it is convenient to rewrite them in terms of independent Mellin 
integrals, which we can then tabulate. 

Following the notation introduced in Ref. [92j, we first consider the gluon coefficient 
function for Fl. The corresponding Mellin transform can be written as 



< (^' S' ^) = '^^ [crri^) + Cl,{N) In g - C^^iN) In ^ 



2 1 

■ (9) 



The coefficient function has been separated into three terms: a Q^-independent term 
^const^jY)^ a collinear log term C^{N), and a scale variation term C^{N). A similar 
decomposition will be performed for all coefficient functions. 
The individual terms are: 
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where the A J are the independent Mellin integrals 

4)=A„(iV + 0. (11) 

Those for which we will use closed-form analytic expressions are collected in Tables [11] 
and [T2J Some of these Mellin transforms were already computed in Ref. [53] and are 
repeated here for completeness. The remainder, for which we will use nymerical approxi- 
mations, are evaluated at the end of this appendix. 

The quark coefficient function for Fl can be similarly written as 
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Let us now turn to the F2 heavy quark coefficient functions. In comparison to the 
longitudinal structure function, there are extra pieces C'^^{N) and C^^{N) arising from 
the double collinear logarithm. The gluon coefficient function for F2 can be written as 
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52 4(0) 
3 ^1 



84(-l) 
3^4 



(20a) 
(20b) 



(20c) 
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Ci, 



,(0) 



Cf {4yl^^ + 4^^'^ + AA'^' + 4^, 



(1) 



,(0) 



4" 



2A 



(0) 



2A 



(1) 



+2A 



(0) 



2A 



(1) 



,(2) 



4 ^^4 3 ^4 

40/1(0) I 4 4(1) I 32 4(2)1 

"T"^i "^ 3"^i "T T"^i J' 



2A 



(0) 



+ |A, 



(-1) 



IO4I) + §42) + 4^ 



(-1) 
1 



a 



2,g 



(1) 



Cf [44"^ +44^^ + 14"^^ +24' 



.(-1) 



l(0) 



2A 



(1) 



l-4f'] 



The gluon radiation coefficient function for F2 can be written as 



ff(2) / AT '3 



m? 



4Tf 



/^const 
'-'2,GiJ 



iN) + Cl%^{N) In^ g + C«c^(Ar) In ^^ 



Q^ 



(20d) 



(20e) 



(21) 



where 



/^const 



Cf [-1{A 



.(0) , 4(2)^ 
^33 ^ ^33 V 

+1(24? -^g- 



'■SO 

.67/^(0) 

14(0) _ 23^(1) 
'2^2 



A 



(l)^ 



^(2A 



27 



A 



(0) 



, 4(0) , 4(2) 



+ 



A 



(0) 



U 



5 
359/94(0) _ 

108 '^^"^27 ^1 
I 29 4(0) _ 295 /i(l) 
"^ 36 "^1 36 ^l 



A 



18 
l(l) 



(2A 



,(0) 
*-32 



o (^;^9 ~r ^' 



28 



A 



(0) 



A 



(2)n 

32 / 

(1)^ 



14(0)^13/1(1) 



^1 'I + 6^4 + 6 ■ 



I 134C(2) 265] 
"•" 18 "•" 36 J ' 



^2,GR 



(0) 



CHi(2<,^-<^-4^^) + i], 



l(l)^ 



c: 



1(0) 



(1)^ 



2,GR 



L^F [3(^^28 ^4 ^4 ) 18V^^27 ^1 ^ 



,(0) 



1(0) 



,(1). 



4r4(0) I /l(2)^_L 14(0), 134(1) 8C(2) 
" 3 1^29 "T ^29 J"'"6l"'"6l 3 6 
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(22a) 

(22b) 
(22c) 



In addition to the elementary Mellin transforms hsted in Tables [TT]|12l the coefficient 
functions contain terms whose Mellin transform is known in closed form, but is expressed in 
terms of generalized harmonic sums Si-^^,,,^i„{N). We have evaluated the Mellin transform 
of these functions through suitable numerical approximations. The x-space expression 
whose Mellin transform will be evaluated in this way are the following: 



fieiz) = 2Li2(-z) ln(l + z) + ln(z) ln2(l + z) + 2S_ 



1,21 



/24(z) = Lis 



/22(^) = Li3(l - Z) 

l-z\ .. / 1 



Lis 



1 + zj V 1 + -Z 

/25(^)=ln(z)ln(l-z)ln(l + z), 

f26{z) =ln(l - z)U2{-z), 

/33(.)-^^^^'-^^ 



(23a) 

(23b) 

(23c) 

(23d) 
(23e) 

(23f) 



1-z 
We will now consider each of these functions in turn. 

In order to determine the Mellin transform of Eq. ()23ap . we use Eq. (32) of Ref. j93j 
which can be written as 



Au(N) = M[MMN) =^M i5M 



A^ l+Z 




(24) 
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fn{z) 



An{N)=M[,Uz)]{N) 



1 

N 



ln(z) 



1 



ln2(z) 



2 

iV3 



ln(l - z) 



SijN) 

N 



111^(1 -z) 



sUn) + S2{n) 

N 



ln(z)ln(l-z) 



Si(N) S2{N)-a2) 
N^ N 



Li2(l-z) 



S2{N)-C{2) 

N 



10 



Li2(-z) + ln(z)ln(l + z) 



2N AN 



..,^)-4f 



11 



ln3(z) 



6 

'n* 



12 



ln{z)\n{l-z) 



2 

N 



m^m-m-m-s,im 



N 



iV2 



N 



13 



ln(l-z)Li2(l-z)-Li3(l-z) 



Si{N)S2{N) - C(2)5i(JV) + S^jN) - C(3) 



14 



ln(z)ln^(l-z) 



N 



^ ^(3) + C(2)5i(7V) - ^ [5?(7V) + S2{N)] 



-SiiN)S2iN) ^ SsiN) 



15 



^1,2(1 -;^) 



--[53(7V)-C(3)] 



17 



ln(l + z) 



ln(2) 1 

N 2N 



Si 



N - 1 



Si 



N 



18 



In^(l-z) 



SfjN) + 3Si{N)S2{N) + 253 W 

N 



19 



Li3(-^) 



3C(3) , C(2) ln(2) 1 



4iV 2A^2 ^3 2A^3 



«'l^)-4f 



Table 11: Elementary Mellin transforms. 



69 



n 


In{z) 


A„(iV) = M[/(z)](7V) 


20 


Hz)U2{-z) 


1 

2iV2 

1 

^2iV2 


n'^ [ 2 j ' 2'^ [ 2 , 
2 /A^\ 1 /iV\ 
T7^i y +^^2 y +C(2) 
N \2 J 2 V2/ 


1 , 41n(2) 

' N _ 




21 


\n^{z)\n{l + z) 


1 

2iV 
1 

~27^ 


[m^'\ 2 j ' iV^^l, 2 j + 2^M 
7 7V2^U 2 J + 7V^n2 J + 2^M 2J 


'n-1\ 41n(2) 
2 ; iV2 




23 


ln(z)Li2(l-z) 


^[52(iV)-C(2)] + |[53(iV)-C(3)] 


27 




-5i(7V-l) 


28 


/ln(l-z)\ 

V 1-^ )+ 


i52(7v_i)+l52(7v_i) 


29 


ln(z) 
1-z 


52(7V-1)-C(2) 


30 


/ln2(l-^)\ 


-\sl{N - 1) ~ 5i(7V - l)^2(iV ~ 1) - ^-S^{N - 1) 


31 


ln2(z) 
1-z 


-2[53(iV - 1) - C(3)] 


32 


ln(z)ln(l-x) 
1-z 


C(3) + C(2)5i(iV - 1) - 5i(iV - l)52(iV - 1) - S:,{N + 1-1) 



Table 12: Continuation of Table [TTl 
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The last two Mellin transforms are respectiveljo: 



M 



and 



M 



l + z 



l + z 



S. 



S^ 



N-1 
2 

N-1 



s,,^ 



SA^ 



(26) 
(27) 



The MeUin transform of the term involving the special function $(z) is given in Ref. 

in terms of the generalized harmonic sum 5i^2(-^)- To avoid evaluating this directly for 

complex A^, we instead rewrite 



-^^Y^a.z'^z), 
^ + ^ k=i 



(28) 



where the values of the coefficients a^ are determined by fitting the polynomial ^^^=1 akZ 
to the function z/{l + z) on the unit interval. One can then use the Mellin transform of 
the function z $(2;), which reads 



M[z'=$(z) 



1 1 

+ 



(N + k)'^ 2{N + k) 



So 



N + k-1 



So 



N + k 



(29) 



The desired Mellin transform Aiq{N) is immediately found combining Eqs. ()26p . ([2 
and dMD- 
In order to determine the Mellin transform of Eq. ()23bp we use the expansion 



00 u 



Li3(^) = Ep- 



fc=i 



(30) 



It follows that 



00 „i 

A,,[N) = m[h2{z)\{N) = E p / ^"^^'(1 

k=i -^0 



zfdz, 



but 



so that 



\"-(i-.)'d.-r(A')r(.- + i) 



k\ 







A22{N) = Y, 



T{N + k + l) {N + k)...{N + l)N 
k\ 



' k'\N + k)...{N + 1)N' 



^In general 



M 



2ln"(z) 
l + z 



(-1)"+V! 



S„+i(^— J-s^+if- 



lsi) 

(32) 
(33) 

(25) 



This Mellin transform is superficially dilTerent from the Mellin transform number 15 in the appendix of 
Ref. [93], but they turn out to be equivalent after suitable simplification. This apparent difference is also 
responsible for the mismatch between entries 13, 28, 43, and 62 in the Table in the appendix of Ref. [53] 
and entries 17, 21, 20 and 19 in Tables [TT] and \T2l respectively. For the same reason entries number 4 and 
57 of 93_ look different from our Eqs. (p6)l - (p9|l . 
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In our implementation we have truncated this series at k = 30. 

Next, we turn to the Mehin transform of Eq. (|23c|) . In this case we fit the function 



10 



5(z) = (1-^)^-15^0,. 



A;=0 



to the function /24(-z)- However, one can show that 

/24(0) = Li3(l) - Li3(-1) = |C(3) = CO, 
so that Co is fixed, and we only have to fit 



10 



giz) = il-z)'~'[lm + Y^ 



CkZ 



(34) 



(35) 



(36) 



fc=i 



The Mellin transform of Eq. (j23cp follows immediately, because 

^10 



^24(iV)=M[/24(z)](iV) =Efc=oCfc/o^ 



10 



r(iv)r(6) ^ (N + k)...iN + i)N 



(37) 



r(iv + 6) ^ ''{N + h + l + k)...{N + b + iy 



In order to determine the Mellin transform of Eq. ()23dp . we use the representation 
given as Eq. (21) of Ref. j94j . in which the function ln(l + z) is approximated by the 
polynomial 



ln(l + z)^^4^^ 
fc=i 

Using also Table [TTl we then get 

8 
A25(iV) = M[/25(^)](iV) = Y. dkm^'' l^(^) 1^(1 - ^)] 



(38) 



k=l 



(39) 



5^4 



fe=l 



Si{N + k) , S2{N + k)-C{2) 



+ 



{N + kf ' N + k 

In order to determine the Mellin transform of Eq. ()23ep we use the expansion 

{-ifz^ 



Li2(-z) = j; 



k=l 



A;2 



so 



A;2 N + k ' 



A,,{N) = m[h,{z)\{N) = Y, ^M[z^ ln(l -z)\ = Y 

k=l k=l 

In this case we have chosen to truncate the series sX k = 100. 



(40) 



(41) 
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X 


, , /(-(n.,0),2\ 


^rel [C'Z''^') 




^rel [Ctf^') 


IQ-'' 


7 X 10-12 


4 X 10-11 


2 X 10-1" 


7 X 10-12 


10-*^ 


3 X 10-11 


1 X 10-11 


4 X 10-11 


3 X 10-12 


10-5 


2 X 10-12 


3 X 10-11 


4 X 10-13 


2 X 10-12 


10-4 


6 X 10-1° 


1 X 10-11 


3 X 10-11 


4 X 10-12 


10-3 


6 X 10-9 


2 X 10-12 


1 X 10-1° 


2 X 10-11 


10-2 


7 X 10-^ 


7 X 10-12 


2 X 10-1° 


1 X 10-11 


10-1 


1 X lO-'^ 


4 X 10-11 


1 X 10-11 


5 X 10-13 


3 X 10-1 


9 X lO-'^ 


1 X 10-11 


1 X 10-11 


5 X 10-13 


5 X 10-1 


4 X 10-6 


3 X 10-11 


8 X 10-12 


1 X 10-13 


7x 10-1 


1 X 10-5 


9 X 10-11 


6 X 10-12 


5 X 10-13 


9 X 10-1 


1 X 10-5 


6 X 10-8 


7 X 10-8 


1 X 10-8 



Table 13: Comparison of the inverse Mellin transforms of coefficient functions computed here 
to the original x-space expressions of Ref. [HI]: the percentage difference between the original 
expression and the numerical Mellin inverse is shown in each case. 

Finally, we determine the Mellin transform of Eq. ()23fp . In this case we use the 
geometric series for 1/(1 — z) so that 



As3{N)M [/33(z)] (N) = Y, M[z'U2il - z)] 



k=0 



E 

fc=0 



S2iN + k)-C{2) 
N + k 



(42) 



As in the case above, the series is truncated at A: = 100. 

Note that the Mellin transform of the x-space coefficient functions involve terms of the 
form z^fn{z). The Mellin transform M[z'/„(z)](A^) can be obtained from NL[fn{z)]{N) 
using the identity M[z'- fn{z)]{N) = M[fn{z)]{N + 1) . Thus the Mellin transform of any of 
the terms in Tables [TT1I12I and also any of the terms Eqs. ()23a|) - ()23fp multiplied by a factor 
z' can be obtained replacing N with N + I. 

In conclusion, we have checked our calculation by comparing the inverse Mellin trans- 
form of the A^-space coefficients with the original x-space results. We show in Table [13] 
the accuracy of this comparison for the various coefficient functions. We find excellent 
accuracy for all coefficients and all values of x. 
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Impact of gluon radiation in F2ligiit FONLL-C 



Impact of gluon radiation in FLIight FONLL-C 



Figure 51: Percentage difference between tfie NNLO ligfit quarlc structure functions F2 (left) 
and Fl (right) with and without gluon radiation contributions. From bottom to top the points 
correspond to scales of Q^=4, 10 and 100 GeV'^ respectively. 



B FastKernel implementation of FONLL-C 

In this Appendix we discuss the implementation and benchmarking of the FONLL-C 
neutral current structure functions in the FastKernel framework. At the same time we 
have implemented and benchmarked FONLL-B, since we expect to perform fits using this 
scheme in the future. 

The implementation of FONLL-B and FONLL-C structure functions in FastKernel 
requires the O (a^) massive heavy quark coefficient functions as well as their asymptotic 
Q^ — )• 00 limit in Mellin space. Li Ref. [8] we presented analytic results for the Mellin space 
O (as) heavy quark coefficient functions for neutral current and charged current scattering 
(see also Ref. [95j for the latter). For the O (q^) heavy quark coefficient functions in Mellin 
space we use the parametrization of Ref. [96j . The Mellin transforms in the asymptotic 
limit as Q^ — ?• 00 have been determined as in Appendix lAl 

The O (a^) massless coefficient functions were first computed in Refs. [97fllUU| . For 
the implementation in the FastKernel framework we have used the fast Mellin space 
parametrizations of the exact coefficient functions as given in Refs. |10HI102] . 

The gluon radiation terms, namely contributions with heavy quarks in the final state 
but where the struck quark is light, have to be treated with care. As discussed in Ref. [12], 
the gluon radiation contribution (which first appears at O (a^)) is part of the light quark 
structure functions. We have checked (see Fig. [5T]) that the size of these terms is very 
small both for the F2 and Fl structure functions, typically below 1%. 

Now we turn to the implementation and benchmarking of these results in the FastK- 
ernel framework. Benchmarking has been performed by comparing the FastKernel results 
with the FONLLdis code |12) . an x-space code that implements all FONLL schemes. 
In Table [TH we show the results of the benchmark comparison for the F2c{x,Q'^) and 
Flc{x,Q'^) structure functions in the massive fixed-fiavour number scheme at O (a^) for 
the FONLLdis code and for the FastKernel code. Results are provided at the reference 
points in the {x,Q^) plane and with the settings of the Les Houches heavy quark bench- 
marks |48) . The accuracy is given as the percentage difference between the FastKernel and 
FONLLdis calculations. The accuracy is never worse than 1%, which is amply sufficient 
for our purposes. The accuracy of Table [HI is a little worse than that of its O (as) counter- 
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part, shown as Tables 14-15 of Ref. [8]. This may be due to the fact that the Melhn space 
parametrizations of the heavy quark coefficient functions that we use [96J are in turn based 
on a parametrization of the exact x-space coefficient functions, while FONLLdis uses the 
original exact coefficient functions of Ref |lU5j . This loss of accuracy is negligible for the 
needs of current phenomenology, but more detailed studies of this issue may be needed 
in the future when final combined HERA heavy quark structure function data become 
available. 

The same comparisons for the FONLL-B and FONLL-C are presented in Tables [15] 
and [16] respectively. Comparable accuracy is achieved for these two GM-VFN schemes, 
again sufficient for our purposes. 





F2c FFNS 


Flc FFNS 


X 


FONLLdis 


FastKernel 


Accuracy (%) 


FONLLdis 


FastKernel 


Accuracy (%) 


Q^ = 4 GeV^ 


10-^ 


0.24591 


0.24244 


1.43 


0.02215 


0.02184 


1.41 


10-4 


0.13658 


0.13481 


1.31 


0.01306 


0.01283 


1.75 


10-3 


0.06384 


0.06308 


1.20 


0.00662 


0.00653 


1.41 


10-2 


0.02025 


0.02007 


0.92 


0.00238 


0.00237 


0.38 


Q2 = 10 GeV^ 


10-5 


0.53701 


0.53904 


0.38 


0.08031 


0.08105 


0.91 


10-4 


0.29558 


0.29550 


0.03 


0.04611 


0.04579 


0.71 


10-3 


0.13909 


0.13852 


0.24 


0.02273 


0.02254 


0.86 


10-2 


0.04689 


0.04664 


0.10 


0.00832 


0.00826 


0.70 


g2 = 100 GeV^ 


10-^ 


1.99594 


2.00744 


0.57 


0.44200 


0.43976 


0.51 


10-4 


1.00912 


1.01479 


0.56 


0.22148 


0.21880 


1.22 


10-3 


0.43527 


0.43410 


0.27 


0.09487 


0.09380 


1.14 


10-2 


0.13574 


0.13492 


0.61 


0.03019 


0.03002 


0.57 



Table 14: Benchmark comparisons for the F2c{x,Q^) and Flc{x,Q^) structure functions in the 
FFN scheme at 0{a'^) obtained using the FONLLdis code [E] and the FastKernel code. Results are 
provided at the benchmark kinematical points in x, Q^ and with the settings of the Les Houches 
heavy quark benchmarks [48j . The accuracy is given as the percentage difference between the 
FastKernel and FONLLdis [T^ calculations. 
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F2c FONLL-B 


Flc FONLL-B 


X 


FONLLdis 


FastKernel 


Accuracy (%) 


FONLLdis 


FastKernel 


Accuracy (%) 


Q^ = 4 GeV^ 


10-^ 


0.24787 


0.24858 


0.29 


0.02519 


0.02524 


0.21 


10-^ 


0.13556 


0.13598 


0.31 


0.01435 


0.01435 


0.01 


10-3 


0.06360 


0.06350 


0.15 


0.00718 


0.00715 


0.34 


10-^ 


0.02062 


0.02051 


0.52 


0.00258 


0.00258 


0.08 


Q2 = 10 GeV2 


10-^ 


0.55100 


0.55088 


0.02 


0.09637 


0.09679 


0.43 


10-4 


0.30114 


0.30150 


0.14 


0.05229 


0.05222 


0.14 


10-3 


0.14371 


0.14375 


0.02 


0.02507 


0.02499 


0.32 


10-2 


0.05012 


0.05015 


0.01 


0.00908 


0.00907 


0.06 


Q'^ = 100 GeV^ 


10"^ 


2.10034 


2.08834 


0.57 


0.48769 


0.48716 


0.11 


10-4 


1.04510 


1.05096 


0.56 


0.23569 


0.23418 


0.65 


10-3 


0.45879 


0.45916 


0.08 


0.09923 


0.09888 


0.35 


10-2 


0.15039 


0.15030 


0.06 


0.03170 


0.03174 


0.15 



Table 15: Same as Table [H for the FONLL-B GM-VFN scheme. 





F2c FONLL-C 


Flc FONLL-C 


X 


FONLLdis 


FastKernel 


Accuracy (%) 


FONLLdis 


FastKernel 


Accuracy (%) 


Q^ = 4 GeV^ 


10-5 


0.27830 


0.28163 


1.18 


0.02468 


0.02500 


1.30 


10-4 


0.14709 


0.14858 


1.00 


0.01423 


0.01441 


1.23 


10-3 


0.06556 


0.06591 


0.52 


0.00733 


0.00735 


0.24 


10-2 


0.02034 


0.02034 


0.00 


0.00281 


0.00283 


0.74 


Q2 = 10 GeV2 


10-^ 


0.69412 


0.69873 


0.66 


0.09909 


0.10062 


1.52 


10-4 


0.34662 


0.34911 


0.88 


0.05520 


0.05550 


0.52 


10-3 


0.15025 


0.15114 


0.32 


0.02682 


0.02699 


0.63 


10-2 


0.04986 


0.05022 


0.13 


0.01002 


0.01008 


0.58 


g2 = 100 GeV2 


10-^ 


2.36920 


2.37887 


0.41 


0.47822 


0.47994 


0.36 


10-4 


1.12695 


1.13870 


1.03 


0.23916 


0.23914 


0.01 


10-3 


0.47058 


0.47317 


0.55 


0.10262 


0.10293 


0.30 


10-2 


0.15175 


0.15236 


0.40 


0.03312 


0.03327 


0.47 



Table 16: Same as Table [H for the FONLL-C GM-VFN scheme. 
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X 


Erel [Uy) 


Crel {xdy) 


ercl(-^-) 


ercl {L+) 


ercl {s+) 


ercl {g) 


10-7 


2.510-4 


2.510-4 


3.410-5 


2.710-5 


4.710-5 


5.710-5 


10-6 


1.510-4 


1.010-4 


1.010-4 


6.110-5 


5.010-5 


5.810-5 


10-5 


1.710-4 


1.610-4 


1.110-4 


5.210-5 


3.610-5 


7.810-5 


10-^ 


1.610-4 


1.710-4 


6.510-5 


4.310-5 


7.710-5 


8.610-5 


10-3 


9.410-5 


6.910-5 


1.910-5 


7.610-5 


7.510-5 


1.010-4 


10-2 


2.910-4 


3.210-4 


4.510-4 


1.010-4 


1.310-4 


1.210-4 


10-1 


2.610-4 


3.810-4 


5.910-4 


5.210-4 


3.410-4 


7.510-5 


310-1 


1.310-5 


1.310-5 


4.710-5 


1.310-5 


9.910-5 


4.810-5 


510-1 


3.810-5 


5.810-5 


3.110-5 


4.210-4 


8.510-4 


2.310-4 


710-1 


1.810-4 


7.910-5 


2.410-3 


4.210-3 


9.410-3 


1.110-3 


910-1 


3.910-3 


1.310-2 


2.210-1 


2.310-1 


5.610-1 


1.910-1 



Table 17: Relative accuracy of NNLO PDF evolution in the ZM-VFNS as implemented in the 
FastKernel framework in comparison to the Les Houches benchmark tables flUSj . 

C Benchmarking the NNLO PDF evolution 

In this appendix we benchmark the accuracy of the NNLO PDF evolution as imple- 
mented in the FastKernel framework. The NNLO anomalous dimensions for DGLAP 
evolution were computed in Refs. [106lll07j . In our code we use the accurate Mellin space 
parametrizations also provided in Refs. |106yi07j . 

The accuracy of our NNLO PDF evolution code, based on the FastKernel framework, 
has been cross-checked against the Les Houches PDF evolution benchmark tables [108). 
These were obtained comparing the HOPPET [109] x-space and PEGASUS [llOj A^-space 
evolution codes. In order to perform a meaningful comparison, we use the so-called iterated 
solution of the A^— space evolution equations, and use the same initial PDFs and same 
running coupling, following the procedure described in detail in Ref. [108]. 

In Table [T71 we show the relative difference Ci-ei between our results and the benchmark 
tables of Refs. [108] , for various combinations of PDFs at NLO in the zero-mass variable 
flavour number scheme. We see that the accuracy is perfectly satisfactory for precision 
phenomenology. Comparable accuracy is obtained in a fixed-flavour number scheme. Fi- 
nally, by benchmarking against PEGASUS, we have also checked that comparable accuracy 
is obtained if the truncated solution of the evolution equations is used, instead of the 
iterated one. 
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